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SYSTEM AND METHOD FOR TRANSCODING INFORMATION FOR AN AUDIO OR 
LIMITED DISPLAY USER INTERFACE 

Inventors: Zhi Wang and Jian Zhang 

5 This application is a continuation-in-part application of, and claims priority from, 

U.S. Patent Application Serial Number 09/706,898 filed November 6, 2000 by Wang, et al, which is a 
continuation in part of U.S. Patent Application Serial Number 09/574,990 by Wang, et al., filed May 
19, 2000, each incorporated herein by this reference. 

10 BACKGROUND OF THE INVENTION 

The present invention relates to a communication system that transcodes information 
prepared for a large visual display into information to be used with a small visual display or to be used 
in an audio format, for example, as on a telephone. 

Via the World- Wide- Web or the Internet, a user has access to a vast variety of 

15 information including, for example, weather forecasting, traffic reports, movie listings, news articles, 
and stock prices. A desk-top personal computer with a conventional desk-top browsing program may 
communicate with the Internet, be directed to search for information (typically apportioned into so- 
called pages), display listed search results, permit selection from the list, and display text, pictures, 
multimedia presentations (audio, animations, movies), and links to other pages. A typical page is 

20 prepared by intermixing with the information to be displayed, various symbols of a markup language 
that direct, among other things, the relative sizes of items to display, color, and relative position and 
of the information. The markup language is read by the browsing program so that the presentation 
better utilizes the particular display of the personal computer, for example by wrapping large amounts 
of text within suitable margins. 

25 Recently, portable devices have been introduced that have the ability to communicate 

via the Internet, operate like a telephone^ and provide a relatively small display area and relatively 
small keyboard. Although these devices may have a browse process analogous to the desk-top 
browsing process discussed above, without the present invention, there is no convenient way to 
accomplish presentation of information received from the Internet because such information was not 

30 designed to be presented on a small display or in audio. For example, if the information is to be 
presented entirely in audio, simply reciting the words from left to right as they might appear on a 
desk-top personal computer display is likely to become tedious to listen to and confusing to the 
listener. Further, there is no convenient way for the user to make selections, to input requested data 
into a form, and to follow hypertext references to other pages (herein called links). It is desirable to 

35 perform these functions from a common telephone, i.e., a device having only audio input and output 
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capability. 

Text and screen readers are known, and such readers have been used to provide 
auditory presentations of textual information through speech synthesis. However, because the content 
received from the Internet is received as a mixture of text and markup language symbols, reciting the 

5 content received would be unintelligible to the average user who is unfamiliar with markup language 
symbols. Because the markup symbols define the visual arrangement of text on a monitor screen, 
simply omitting the markup symbols destroys relationships among the elements of information, 
thereby confusing an audio presentation of the information. 

Although the Internet is primarily designed for visual-based access and navigation 

10 (e.g., searching and following links), there is a growing interest in access to information organized for 
presentation in either an auditory or a primarily auditory with visual format. For example, users may 
include the visually impaired and individuals who, by preference, timing, location, or proximity to 
equipment, desire to access the Internet over a common telephone. Thus, it is desirable to facilitate a 
more understandable and controllable experience for individuals who access the content of the 

1 5 Internet through equipment adapted for either an auditory or a primarily auditory with visual format. 

To meet demand for pages suitable for presentation on a small display or in audio as 
derived from pages originally prepared for presentation on a desk-top PC, it is desirable to automate a 
method of code transformation. Unfortunately, conventional methods of code transformation may be 
expensive to implement on a large scale due in part to the need to overcome structural and functional 

20 deficiencies with special purpose programming code. For example, when the input page is compliant 
with Extensible Markup Language (XML), transformations to accomplish various display styles 
suitable for presentation on a desk-top PC may be accomplished using the JAVA programming 
language and/or using an application program interface, such as, Extensible Stylesheet Language 
Transforms (XSLT). It is therefore desirable to provide automatic transformations with less 

25 programming and debugging time and with capabilities not included in XSLT. Preferably, the nature 
of the transformations should be readily apparent from reading a text file and be applicable to pages 
that are subject to frequent change. Descriptions of XML and XSLT are available from W3C, for 
example, at Internet web sites <http.7/www.w3.org/TR/xslt> and <http://vmw.w3.org/TR/REC-xml>. 

Pages made available by an author for browsing via the World- Wide- Web or the 

30 Internet are subject to change at any time by the author. Changes may be desired to assure that the 
information presentation is appealing, accurate, complete, and helpful, to name a few popular reasons. 
Changes may be made to the content of a page (e.g., text, graphics, and links to other pages), the 
structure of the page (e.g., how the information is organized for presentation), and the position of the 
page in a series, network, or hierarchy of pages. It is desirable to be able to make reliable reference to 

35 
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information presented on such an often revised page from an automatic transformation method or 
from another page (e.g., a summary page). 

Without the systems and methods taught herein, some desirable forms of accessing 
information from the World- Wide- Web and the Internet will remain impractical initially and 
5 impractical to support over time in view of frequent changes to the content, the structure, and the 
organization of the information. Consequently, the value of convenient access to a vast variety of 
information may be diminished or never fully realized. 

SUMMARY OF THE INVENTION 

10 A system for voice browsing or transcoding permits access and navigation to 

information via an audio user interface or a limited display user interface. The audio user interface 
permits speech or telephone keypad input to interrupt the presentation of information, for example, to 
direct following a hypertext link. When operating with the Internet, the user of one or more access 
devices including a cell phone, telephone, portable computer, or workstation may request a web page 

15 by providing an address. Guidance may be prepared ahead of time by the user operating an editor. 
Guidance may direct summarization, selection, annotation, and restatement of the requested content. 
Guidance may include statements of a markup language that include a structural summary description 
of the content or a hidden Markov model. To allow for frequent changes in the arrangement of 
content on web pages provided by the Internet, portions of the content are identified for the 

20 application of particular guidance by aligning a structural summary description of the current content 
to the structural summary description stored with the guidance. Guidance suitable for particular 
content may be located on the basis of an address of the content (e.g., a URL), of a related address, or 
of being within the scope of a regular expression. Guidance produces derivative content which may 
be easier to understand when presented in audio or easier to understand when presented on a display 

25 of limited area or resolution. Derivative content may include a set of web pages hierarchically 

organized with connecting links. The complexity and importance of regions of the requested web 
page are assessed in developing the hierarchy and organization of the set of web pages to be 
presented. Each web page may then be expressed in a voice mark up language and presented through 
a speech engine to the user. Information from the requested web page may be presented on the same 

30 or a different access device, 

A presentation by a first user interface may include a link for transferring control to a 
second user interface. For example, the user of a limited display device that includes a telephone 
(e.g., a so-called web phone) may follow a link on the limited display for terminating the current 
session of a limited display user interface, placing a telephone call, and establishing a session with an 

35 audio user interface. 
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According to various aspects of systems and methods of the present invention, 
guidance for transcoding may be prepared, maintained, and employed in a markup language having 
semantics for expressing at least one of procedural, declarative, input, and output operations. By 
including semantics for both procedural and declarative operations, particular guidance may have 
5 wide application to various sets of pages that are subject to frequent change, and be succinct for ready 
comprehension by a programmer. 

According to various additional aspects of systems and methods of the present 
invention, software that includes a process for transcoding may be structurally simplified by 
employing similar parsing functions for analyzing content to be transcoded, for analyzing guidance 

10 that directs such analyzing and transcoding, and for parsing system inputs and user inputs related to 
such analyzing and/or transcoding operations. 

A method, for providing derivative content from provided primary content, including: 
(a) parsing guidance in accordance with a markup language; and (b) providing derivative content in 
accordance with the primary content and in accordance with any number of nested subroutines 

15 declared in the guidance by nested elements of the markup language. Use of nested subroutines 
simplifies complex transcoding operations while maintaining human readability of the guidance. 

According to a first aspect of such a method, the guidance may include a first element 
(e.g., for a main routine) having a call, a declaration of a first subroutine, and a declaration of a 
second subroutine. The call may include a first element identifying the first subroutine, and a second 

20 element identifying the second subroutine, the second element being nested within the first element. 
Primary content may then be provided as directed by the parsed guidance in a manner including 
performing in a first execution context until encountering the call, performing the second subroutine 
in an execution context identified in the declaration of the second and the first subroutines, and 
returning to performance in the first execution context after the call. 

25 hi various implementations, according to the present invention, parsing of the 

guidance may be performed by a language processor to provide an output tree representation. 
Providing derivative content may be accomplished by traversing the output tree representation. 
Advantageously, some parsing operations used when parsing the primary content may be reused when 
parsing the guidance. Software that includes a process for transcoding may, consequently, be 

30 simplified, less extensive, less complex, and operate more efficiently. In one implementation 

according to various aspects of the present invention, a process for transcoding includes a language 
processor and a process for providing derivative content. 

In an alternate implementation, parsing of the guidance may be performed by a 
language processor to provide a description of derivative content. Providing derivative content may 

35 be accomplished in accordance with the description. The description may include references to 



4 



46806.00007 



primary content, to guidance, and to variables declared by the guidance. In one implementation, each 

reference is a pointer. By deriving content from a description, copying of content, constants, and 

variables to a separate output storage area may be avoided. 

In yet another implementation, a representation of content (e.g., primary content) is 
5 revised to include indicia of v^hat portions of the representation are to be included in derivative 

content. The representation of content serves as a description of derivative content. Copying portions 

of content to a separate output storage area may be avoided. 

Another method for providing derivative guidance in accordance v^^ith various aspects 

of the present invention includes (a) parsing guidance in accordance with a markup language having 
1 0 elements; and (b) providing derivative content in accordance vs^ith a tree representation of primary 

content and content referenced from an element of the guidance. By using a tree representation of 

derivative content, operations on referenced content are simplified. 

In various implementations, guidance may be parsed to provide a guidance tree 

including elements each of which make reference to a respective uniform resource identifier (URI) for 
15 content. Primary content may be parsed to form a content tree. Referenced content may be parsed to 

form a subtree. Derivative content may be provided in accordance with the content tree and the 

subtree. 

The subtree may be inserted at a node of the content tree, thereby revising the content 

tree. Providing derivative content may proceed by traversing the revised content tree. 
20 A reference may be identified by an element of guidance that is nested in another 

element. Elements that provide a reference may include one or more subroutine elements. An 

element may alter the context for determining the target of a reference (e.g., the code to be processed 

on encountering a subroutine name, the value to be processed on encountering a variable name). 

When an <include> or <moduie> element refers to a URI and the data addressed by the URI includes 
25 further guidance (e.g., processes, subroutines, variables, or other content), references may resolve to 

targets in the data addressed by the URI. Target content and target processes may be referenced with 

user-defined names (e.g., subroutine element tags). 

Guidance may be expressed as elements of a markup language each having a tag. 

Guidance may include as an element the declaration of a subroutine having a user-defined tag, 
30 Guidance may include as an element the declaration of a variable capable of representing a tree. A 

tree variable advantageously may be used to represent complex data types including data structures, 

arrays, and stacks. 

BRIEF DESCRIPTION OF THE DRAWING 
35 Embodiments of the present invention will now be further described with reference to 
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the drawing, wherein like designations denote like elements, and wherein: 

FIG, 1 is a functional block diagram of a system according to various aspects of the 
present invention; 

FIG. 2 is a message sequence diagram in an exemplary implementation for providing 
5 an audio user interface in the system of FIG. 1; 

FIG. 3 is a flow diagram for a method for supporting various user interface functions 
of the system of FIG. 1; 

FIG. 4 is a data flow diagram of a process for preparing derivative content from 
primary content and guidance according to various aspects of the present invention; 
10 FIG. 5 A is a flow diagram for a method for finding guidance performed in the process 

of FIG. 4; 

FIG. 5B is a flow diagram for a method for reducing the complexity of content to be 
presented as performed in the process of FIG. 4; 

FIG. 6 is a flow diagram for presenting derivative content in the method of FIG. 3; 
15 FIG. 7 is a message sequence diagram in an exemplary implementation for supporting 

a limited display device in the system of FIG, 1; 

FIG. 8 is a message sequence diagram in an exemplary implementation for providing 
a guidance editor in the system of FIG. 1; 

FIG. 9 is a screen layout for a graphical user interface for editing guidance according 



20 to FIG. 8; 

been directed; 



FIG. 1 OA is a portion of the screen layout of FIG. 9 after a revision to guidance has 



FIG. 1 OB is a schematic presentation of derived content on a limited display device; 

FIG. 1 1 is a message sequence diagram in an exemplary implementation of a method 
25 for supporting a device having both an audio device and a limited display device; 

FIG. 12 is a flow diagram for applying guidance according to various aspects of the 
present invention; and 

FIG. 13 is a data flow diagram of a process for transcoding according to various 
aspects of the present invention. 

30 

DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS 
The present invention provides, inter alia, a system for accessing and presenting 
information. The information may be accessed via a computer network (e.g., a local area network, an 
intranet, or the Internet). Information may be accessed using a common telephone or a computer 
35 workstation. In addition to receiving an audio presentation of information, the user may request other 
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information (e.g., navigate the Internet) by responding to words spoken by one or more voices (e.g., 
such as a female voice and a male voice). To request information, the user may repeat a word, actuate 
a button on the telephone, or speak a command. The response may specify a link to follow or specify 
information to provide to a process or server on the network. 
5 A system according to various aspects of the present invention may include processes 

for transcoding, voice browsing, and text-to-speech software for reciting information in audio for the 
user to listen to. Transcoding is generally a process that acts upon information expressed in a markup 
language (e.g.. Hypertext Markup Language (HTML), Extensible Markup Language (XML), and 
adaptations of these for voice and wireless systems such as Wireless Markup Language (WML) and 

10 Voice Extensible Markup Language (VoiceXML)). Liformation (e.g., content) that is expressed in a 
suitable markup language may be represented as a tree having nodes. Nesting of elements of the 
markup may be represented as parent-child relationships among the nodes and as subtrees. Each node 
may have a portion of the information that is expressed in the markup language. A common 
presentation of the information represented in a tree may be developed by traversing the nodes of the 

1 5 tree (e.g., in a depth-first manner). For example, a result of transcoding may be input to a text-to- 
speech process to be recited in audio. Voice browsing may respond to any of various inputs from the 
user including the user's spoken words and key actuation on a keyboard or telephone dialing keypad. 
Browsing generally includes following a link to a desired page. 

In operation a system according to various aspects of the present invention may 

20 provide a user with: (a) access to information such as news headlines, financial information, weather 
information, etc., in a manner that is personalized and substantively controlled by the user; and (b) a 
substantially "audible only" version of such information. Such a system permits the user to retrieve 
customized news and other information. Further, access is provided to user-requested information 
(e.g., web pages) that have not been edited by the user. For example, information received from the 

25 Internet (e.g., web pages in HTML) may be transcoded by application of guidance to provide selected 
information in VoiceXML for recitation to the user via a speaker or telephone. 

A service operative in conjunction with the Internet and provided according to 
various aspects of the present invention may provide, inter alia, a personal page for each customer 
and a transcoding service to transcode a conventional visually-friendly HTML web page to an aurally- 

30 friendly set of pages. Each customer may specify a personal page to include, for example, status of e- 
mail, a calendar, a news report, a sports report, a weather report, a traffic report, and movie listings. 

Any information in a hypertext or markup language (e.g., an Internet web page) may 
be transcoded according to various aspects of the present invention. Transcoding may include 
applying directives supplied in a file herein called guidance. Guidance may be created by the user for 

35 each web page of interest, thereby facilitating listening preferences. Guidance may specify the 
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manner in which the web page is rendered (e.g., visually and/or in audio), for example, by specifying 
features such as volume, rate, pitch, direction, suppressing output for specific elements, spelling out 
particular text letter by letter, speech fonts (male/female, adult/child, etc.), inserted text before and 
after an element of content, sound effects before, during, and after elements of content, and adding 
5 music or prerecorded sounds. 

A communication system, according to various aspects of the present invention, 
includes any plurality of computers (e.g., servers, workstations, and portable devices including 
telephones, and portable computers) coupled to a network for the exchange of information to any of 
the computers as a visual presentation and/or to any of the computers as an audio presentation. The 

10 system may provide, inter alia, a graphical user interface (GUI), an audio user interface, and a limited 
display user interface. For example, system 100 of FIGs. 1-13 includes access devices 110, gateway 
devices 120, a network 130, and network servers 140, Access devices 1 10 are coupled from time to 
time to gateways 120 via any conventional physical and logical protocols. Gateways 120 and network 
servers 140 (e.g., home site server 141 and any site server 142) are coupled from time to time (or by 

15 dedicated links) to network 130 via any conventional networking technology (e.g., links 180 and 181 
respectively) including physical and logical protocols that may differ from the physical and logical 
protocols discussed above with reference to access devices 110, These protocols permit information 
to be transferred effectively simultaneously between any one or more gateways and servers. Each 
fimctional block shown in FIG. 1 is representative of any number of like functional blocks that may 

20 operate independently (e.g., cell phones operated by independent users), operate in tandem (e.g., 
numerous transcoder gateways cooperating from common input and output queues), or operate at 
different geographical locations or for different special purposes (e.g., wireless gateways for different 
protocols or home site servers at different corporate headquarters). 

Access devices include any device that facilitates access to information that is stored 

25 on a server coupled to a network. Access devices 1 10, for example, comprise an unrelated set of a 
wide variety of equipment operated independently by many users. Representative devices include 
workstation 111 (e.g., a desk-top computer), wireless device 112 (e.g., a laptop, palm-top, or network 
appliance), telephone 113, and cell phone 114 (e.g., any wireless telephone or pager). Links shown 
between functional blocks represent any communication capability including for example, radio, 

30 infrared, and wired networks including cable TV, local area networks, the public switched telephone 
network, and proprietary dedicated lines. 

Workstation 111 may be coupled to network 130 through Internet service provider 
(ISP) gateway 124 by links 170 and 179. Wireless device 112 may be coupled to network 130 via a 
wireless link 171 to wireless gateway 121, link 172 to ISP gateway 124, and link 179. Wireless 

35 device 112 may alternatively be coupled to network 130 via wireless link 171 to wireless gateway 
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121, link 174 to transcoder proxy server 125, and link 178. Telephone 113 may be coupled to 
network 130 via wired link 173 to voice browser server 123, link 177 to transcoder proxy server 125, 
and link 178. Cell phone 1 14 may be coupled to network 130 via wireless link 175, cellular gateway 

122, link 176 to voice browser server 123, link 177, transcoder proxy server 125, and link 178. 

Any conventional protocols may be used for communication among the functional 
blocks shown for system 100. Each logical and physical protocol for each link discussed above is 
implemented with a portion of the protocol functions at the ends of the link. Suitable protocol 
functions are described in Table 1, below. Gateways 120 may cooperate with access devices 1 10 to 
support connection service or connectionless service. 



TABLE 1 



Access Device I/O Functions 



Workstation 1 1 1 receives data (e.g., messages and 
files) for presentation to the user. At a minimum, 
workstation 111 presents received data through a 
conventional browse process (e.g., Litemet Explorer 
marketed by Microsoft) for a visual presentation 
(e.g., text, graphics, animations, or video) on a 
monitor. Workstation 111 may include speakers, 
convert data to analog audio, and drive the speakers. 
When link 170 is a LAN or cable TV carrier, 
workstation 111 may operate with data, video, or 
television signals. Data received may be in a 
markup language (e.g., HTML or XML) interpreted 
by the browse process. Workstation 111 sends data 
that originates as user keystrokes on a keyboard, user 
operation of a pointing device (e.g., mouse 
movements and mouse clicks), speech into a 
microphone, data from a file, or video from a 
camera. A browse process may convert such inputs 
into requests for information (e.g., a URL with 
arguments of the type described in RFC 1738) or 
into messages (e.g., email, attachments to email, or 
HTTP post commands). 



35 



Gateway Protocol Functions 



ISP gateway 124 provides messages (e.g., 
TCP/IP) on link 170 that may be interpreted 
by a browse process in workstation 1 1 1 as 
data in a hypertext or markup language (e.g., 
HTML or XML). The manner of presentation 
may be effected by in-line style specifications 
or by references to other files (e.g., cascading 
style sheets in HTML or extended style sheet 
language (XSL) in XML). Style 
specifications may direct text and graphics 
and/or direct audio, video, and TV 
presentation. Style specifications may be 
retrieved from any server 140, any gateway 
120, or maintained on workstation 111. 
Gateway 124 may accomplish protocol 
conversion between link 170 (e.g., HTML, 
TELNET, UDP) and link 179 (e.g., XML, 
TCP/IP). Gateway 124 provides message 
routing between workstation 111 and any 
server (e.g., servers 140 and transcoder proxy 
server 125). 
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Access Device I/O Functions 


Gateway Protocol Functions 




Wireless device 112 may support link 171 as a 


Wireless gateway 121 may support link 171 as 




digital connection oriented path or as a 


a digital coimection oriented path or a 




connectionless path for data exchange. Data 


connectionless path; and, may support 


5 


received may be presented to a user on screen (e.g., 


connection maintenance (e.g., monitor link 




laptop or palmtop) as discussed above v^ith reference 


quality, retransmit, use alternate channel, 




to workstation 111, except that such a screen on a 


automatic fall-back, or re-establish link) if the 




wireless device may have less visible area and/or 


connection becomes unreliable. Gateway 121 




lower resolution. Wireless device 1 12 may include a 


may support file transfer and remote 


10 


browse process similar in some respects to the 


configuration capabilities. Gateway 121 




browse process discussed above with reference to 


provides messages (e.g., TCP/IP) on link 171 




workstation 111. Wireless device 1 12 may include 


that may be interpreted by a browse process in 




an alphanumeric keyboard (or pen) and a pointing 


wireless device 1 12 as data in a hypertext or 




device (mouse or pen) to provide user input 


markup language (e.g., HTML or XML). ISP 




functions similar to workstation 111 discussed 


gateway 124 may therefore provide link 172 




above. Also, a microphone may be included with 


in an analogous manner to link 170 discussed 




functions described with reference to workstation 


with reference to workstation 111. Wireless 




111. Wireless device 1 12 may be programmable 


gateway 121 may accomplish protocol 




(e.g., by download from Internet 130) and may be 


conversion between links 171 (e.g., WML, 


S:20 


used generally as a telephone or as a workstation. 


WAP) and 172, 174 (e.g., XML, HTTP). 




Telephone 1 13 receives analog audio signals (e.g., 


Voice browser server 123 samples analog 




dial tone, computer generated speech) and sends 


audio to (a) detect firequencies of DTMF and 




analog audio signals (e.g., user's voice, dialing 


identify the corresponding key press/release as 




tones). Telephone 1 13 provides an audio user 


data (e.g., a message alerting to an interface 


25 


interface that includes a speaker that the user listens 


event, or one or more time-stamped records); 




to, a microphone that detects the user's speech, and a 


and (b) to capture user's speech as data (e.g.. 




keypad the user operates to generate dual-tone multi- 


streaming audio, messages containing sample 




frequency (DTMF) dialing signals. 


data, or a file of sample data). Voice browser 
server 123 provides data (spoken or keyed) to 


30 




network 130 via link 177, transcoder proxy 
server 125, and link 178 (e.g., using HTTP 
and TCP/EP). Voice browser server 123 also 
implements an audio user interface according 
to various aspects of the present invention as 


35 




discussed below. 
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Access Device I/O Functions 


Gateway Protocol Functions 


Cell phone 1 14 may support link 175 as a digital 
connection oriented path for data exchange. Cell 
phone 1 14 provides a microphone and a keypad to 
receive user input as discussed above with reference 
to telephone 1 13, but user inputs may be converted 
to digital signals for transport on link 175. The 
keypad may have many more general purpose keys 
(e.g., alphanumeric) or special purpose keys (e.g., 
TDD service for the deaf). Cell phone 1 14 provides 
a speaker for audio output as discussed above with 
reference to telephone 1 13 but cell phone 1 14 may 
convert data received on link 175 to analog audio to 
drive the speaker. Cell phone 1 14 may have a 
screen similar in fiinction to the screen of wireless 
device 1 12, though typically much smaller with 
lower resolution and lower color visibility. 


Cellular gateway 122 may support link 175 as 
a digital connection oriented path and in 
addition support connection maintenance (e.g., 
monitor link quality, retransmit, use alternate 
channel, automatic fall-back, or re-establish 
link) if the connection becomes unreliable. 
Cell phone 1 14 may be capable of receiving 
data for storage on cell phone 1 14, in which 
case gateway 122 may support file transfer 
and remote configuration capabilities. 
Cellular gateway 122 may support protocols 
on link 176 that are identical to protocols used 
on link 173. Cellular gateway 122 may 
accomplish protocol conversion between links 
175 (e.g., proprietaiy protocols of various cell 
phone networks) and 176 (e.g., analog audio, 
or Wireless Application Protocol (WAP)). 



20 Gateways and servers typically comprise computers having storage capacity and/or 

special purpose hardware and software for efficient data access and network communication. For 
example, each functional block of gateways 120 and servers 140 may consist of multiple computers 
having disk storage subsystems, conmiunication interfaces (e.g., modems, radio transceivers) and 
network interfaces (e.g., T carrier) that may be compliant with industry standards and conventions 

25 adapted for use on the Internet. In an alternate implementation, workstation 1 1 1 comprises equipment 
similar to gateway and server equipment and vice versa. In other words, gateways, servers, and 
workstations may be distinguishable by the fimctions performed and these functions may be primarily 
dependent on the software that operates on each of these otherwise general purpose computers. 

Each of the computers of system 100 (including embedded computers in wireless 

30 devices, telephones, and cell phones) may include functions performed in software (e.g., including 

firmware) collectively referred to herein as processes. Processes may include an operating system and 
a set of application programs suitable for the functions being performed. Application programs may 
include hardware interface specific processes (e.g., drivers) as well as processes suitable for 
simultaneous use by several processes or threads. Processes may be integrated with the operating 

35 system or loaded and maintained separate from the operating system. Processes, mcluding the 
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operating system, may be implemented in firmware and for some functions in circuitry. 

A message sequence provided by processes operating in a system according to 
various aspects of the present invention may provide an audio user interface. An audio user interface, 
inter alia, permits user access to information stored on a network (e.g., navigation of the Internet by 

5 following hypertext links). For example, in message sequence 200 of FIG. 2, processes cooperate to 
receive a telephone call from a user of an audio device and to provide an Internet web page via a link 
identified by the user. For clarity of explanation of an audio user interface, the Internet and its 
conventional protocols perform the functions of network 130 and links 178-181. 

Audio device 202 represents any access device having audio input/output (I/O) 

10 capability and the ability to participate in an audio telephone call. Call process 204, performed by 
audio device 202, performs all conventional steps to initiate a telephone call to voice browser server 
123. Answer process 208, performed by voice browser server 123, performs all conventional steps to 
accept the call. After the call is established, call process 204 provides (step 242) identification (e.g., 
via DTMF signaling initiated by the user or automatically by call process 204) sufficient for answer 

15 process 208 to complete a conventional login scenario (step 244). Login process 212, performed by 

ISP server 124, receives and responds to the login scenario to establish a session on ISP server 124 for 
the user of audio device 202. Any conventional process may be used by call process 204, answer 
process 208, and login process 212, In an alternate implementation, a session is implied by system 
operation (e.g., dedicated links) and steps 242 and 244 are omitted. 

20 Processes 216-226 may be performed as shown by five servers in separate physical 

locations. In an alternate implementation, process 216-222 may be performed by a single server at 
one location. Independent servers are preferred for higher throughput for each of processes 216-222. 
Processes 212 and 222-226 may include conventional software, for example, the Apache web server 
marketed by the Apache Software Foundation. Processes 204 and 206 may be accomplished by circuitry 

25 of a conventional analog telephone. 

At a time after the session is established, (e.g., in response to interprocess messaging 
initiated by answer process 208) voice browse process 216, performed by voice browser server 123, 
requests information (step 246) expected to be provided by a server 140 on network 130. The initial 
request may correspond to a "home" page. The request may be made by addressing a message to 

30 transcode process 218 that identifies the requested page. In an Internet implementation, the request 
includes a Uniform Resource Locator (URL) with arguments (URLl at step 246) of the type described 
in RFC 1738 (Request For Comment available from the W3C). URLl may include indicia of 
identification of the address of transcoder proxy server 125; the address of ISP server 124; the address of 
home site server 141; the requested page from home site server 141; the address of voice browser server 

35 1 23 ; and the type of process 2 1 6 as an agent. 
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Transcode process 218, performed by transcoder proxy server 125, generally restates 
information from URLl to prepare URL2 (step 248). URL2 is a URL of the type described above that 
may include indicia of identification of the address of transcoder proxy server 125; the address of ISP 
server 124; the address of home site server 141; and the requested page from home site server 141, v^hich 
5 is sufficient to obtain the desired home page. 

Route process 222, performed by ISP server 124, performs conventional routing of 
requests. For example, route process receives URL2 (e.g., according to HTTP and TCP/IP protocols) and 
provides a request (substantially the same as URL2) to serve process 224 (step 250). 

Serve process 250, performed by home site server 141, performs conventional search, 
10 access, and response fiinctions to satisfy incoming requests. For an Internet implementation, serve 

process 224 provides a v^eb page (PAGEl) that includes information in a markup language (step 252) 
having tags. PAGEl is typically suitable for presentation on a workstation having a relatively large 
screen, color monitor. 

Route process 222 routes PAGEl on receipt from process 224 to transcode process 218 

1 5 (step 254). Route process 222 typically does not revise the information being routed. 

On receipt of PAGEl, transcode process 218 prepares a substitute page (PAGE2) and 
transmits the substitute page to voice browse process 216 (step 258). The operations necessary to 
produce PAGE2 are generally referred to as transcoding. PAGE2 may include information that has been 
summarized from the information of PAGEl, selected from the information of PAGEl, annotations, and 

20 restatements of the information of PAGEl . PAGE2 may be provided in any markup language. 

Preferably, transcode process 218 uses the identity of voice browse process 216 to determine a suitable 
markup language. For example, when voice browser server 123 performs multiple instances of more than 
one browse process and some browse processes accept HTML while others accept XML, transcode 
process 218 prepares PAGE2 in a markup language corresponding to indicia of identification received in 

25 URLl, as discussed above. 

Transcode process 218 summarizes information when the amount of information 
provided in step 252 exceeds limits of magnitude and/or complexity. The resulting summary may be 
hierarchical. For example, if PAGEl included 4 news stories, state and local weather, and stock market 
averages, a set of pages may be prepared of which PAGE2 is a member. PAGE2 may be prepared to 

30 include merely headlines of each of the news stories each followed by a link to the news story in frill; a 
link to state weather; a link to local weather; and a link to stock market averages. Other members of the 
set may include , for example, PAGE2.1 (a news story); PAGE2.5 (a state weather report); PAGE2.7 (a 
list of averages); PAGE2.7.1 (DOW volume, closing average, highest average, and lowest average); and 
PAGE2.7.2 (NASDAQ volume, closing average, highest average, and lowest average). 

35 Information may be selected from information received at step 252. In other words 
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information may be excluded from appearing in PAGE2 (or a lower level hierarchical member of the set, 
as discussed above). Such information may not be suitable for the user interface (e.g., not suitable for 
audio, or too large for the small display area) employed on device 202. For example, there may be no 
description of a bitmap element of PAGEl; or, there may be no description of links that appear in several 

5 places on PAGEl so as to avoid confiising the user with unnecessary redundancy. 

Annotations may be added before and/or after information elements received at step 
252. For example when PAGEl includes a daily change of Dow Jones Average stated as +139.4, 
PAGE2 may include information describing the number 139.4 as follows "an increase of 139.4 
points". The words "an increase of may replace the "+" and the word "points" may be added. 

10 Annotations improve perception of the information by the user of audio device 202. 

Restatements of information may include conventional transcoding operations, as 
well as operations desired for better presentation of information to the user of device 202, For 
example, when PAGEl includes an animation or marquee text, PAGE2 may be prepared to include a 
restatement of the text in full without regard to the original animated or scrolling presentation in 

15 PAGEl. Further, when PAGEl includes in-line scripts (e.g., style information or references to 

cascading style sheets) or program code information (e.g., reference to a JAVA applet, or a common 
gateway interface (CGI) command), transcode process 218 may prepare a portion of PAGE2 to 
represent a suitable substitute function that is compatible with voice browse process 216. 

Transcode process 218 is directed to perform summarization, selection, annotation, 

20 and restatement, inter alia, according to guidance prepared, stored, retrieved, and referenced in 
accordance with various aspects of the present invention. Guidance, represented for example as 
MASK database 230 in FIG. 2, differs from a conventional style sheet in several ways. A 
conventional style sheet (e.g., written in CSS for HTML or in XSL for XML) must be identified by a 
reference appearmg in the information (e.g., part of PAGEl). By contrast, PAGEl, according to 

25 various aspects of the present invention, need not contain any reference to guidance of database 230. 

A browse process receives content for presentation and refers to a conventional style sheet to interpret 
tags of the markup language of the received content. By contrast, guidance of database 230, directs a 
transcode process how to prepare derivative content to be input to a browse process. The transcode 
process receives content for transcoding in a first markup language and refers to guidance in a second 

30 markup language for directing how to prepare an output to a browse process in the same or a different 
markup language than the first markup language. Consequently, the second markup language, 
according to various aspects of the present invention, includes tags, that direct functions not possible 
or meaningful with respect to the use of a conventional style sheet. 

Transcode process 218, on receipt of indicia identifying the desired information to be 

35 obtained via ISP server 124 (e.g., URLl) or on receipt of indicia identifying the information received 
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from ISP server 124 (e.g., PAGEl title), retrieves suitable guidance from database 230 (steps 256- 
258). Summarization, selection, annotation, and restatement, as discussed above, are then 
accomplished based on the information of PAGEl to produce PAGE2. PAGE2 may conform to a 
voice markup language (e.g., VoiceXML or a proprietary voice markup language). Transcode process 
5 218 then provides PAGE2 to voice brow^se process 216 (step 260). 

Voice brovv^se process 216 may include a conventional text-to-speech (TTS) 
capability having an input queue and an analog audio output capability. For example, voice browse 
process 216 may include Microsoft SAPI marketed by Microsoft Corp. Voice brow^se process 216 
parses the incoming information sent by transcode process 21 8 in a markup language (e.g., PAGE2); 
10 substitutes and adds corresponding information according to conventional audio style sheets and other 
configuration settings; loads the TTS queue and provides an analog signal (RECITAL 1) to I/O 
process 266 of audio device 202 (step 262). 

Voice brov^se process 216 responds to DTMF signals from audio device 202 and/or 
speech originating w^ith the user of device 202 (step 264). Voice browse process 216 may include a 
115 conventional speech recognition (SR) capability having an analog audio input. Speech by the user 
t: may cause I/O process 206 to provide an analog audio signal SPEECHl to voice browse process 216. 

:j , For example, consider RECITAL 1 to have caused I/O process 206 to drive a speaker in audio device 

202 with the audio message "press or say 1 for local weather" whereupon the user responded saying 
"one" and I/O process 206 provides an analog audio signal (SPEECHl) corresponding to "one". 
n.20 Voice browse process 216 responds accordingly to determine a URL that provides local weather (step 

266) and sends the URL (along with the information described above with reference to step 246) 
^ (URLS ) to transcode process 218. 

Transcode process 218, in a manner analogous to operations performed as discussed 
above with reference to step 248, determines a suitable URL for local weather and sends (step 268) 
25 the URL to route process 222 (e.g., URL4). Route process 222, in a manner analogous to operations 
performed as discussed above with reference to step 250 sends (step 270) the URL (e.g., URL4) to 
serve process 226 of any site server 142. Server 142 responds with information (e.g., PAGES) in a 
markup language (step 272) which is sent via route process 222 without substantial revision to 
transcode process 218 (step 274). Transcode process 218, in a manner analogous to operations 
SO discussed above with reference to steps 256-260 prepares and sends information to voice browse 
process 216 (e.g., PAGE4) in a markup language (steps 276-280). Voice browse process 216, in a 
manner analogous to operations discussed above with reference to step 262, prepares and sends an 
analog audio signal (RECITAL2) to I/O process 206 (step 282) to convey to the user the information 
corresponding to the requested page from any site server 142 (e.g., state weather information as 
35 summarized, selected, annotated, and restated), 
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A method of providing a user interface according to various aspects of the present 
invention provides information adapted for audio presentation and/or adapted for a limited display. 
Such a user interface may accept commands in audio or as keystrokes for navigation to other 
information for presentation. Accordingly, a computer may perform method 300 of FIGs. 3-6, 12 and 
13 beginning by requesting content from another node of a computer network (step 302). Content 
means information in any form. Content is provided in messages (e.g., a page, streaming audio, or 
streaming video) or objects (e.g. a file, or a downloaded executable program, applet, or script). 
Content may refer to a home page of the Internet, as discussed above. The request may be in 
accordance with any conventional protocol, including by providing a URL as discussed above at step 
246. 

Derivative content is prepared (step 304) based on at least a portion of the so-called 
primary content requested and received in the previous step. Derivative content may be in stand alone 
(e.g., a single message or file) or streaming format (a series or set of messages or files). Derivative 
content may take the form of a page, as discussed above (e.g., PAGE2). Guidance for preparing the 
derivative content may be integral to (e.g., like conditional in-line styles), referenced by (e.g., like 
external style sheet), or separate from the primary content. In one implementation, the primary 
content makes no reference to guidance. For example, as described above, stand alone derivative 
content (e.g., PAGE2 or PAGE4) may be prepared based on primary content (e.g., PAGEl or 
PAGE3) in accordance with guidance (e.g., database 230) that is separate from and not referred to 
from the primary content. 

The derivative content may be presented while allowing the interjection of a 
command (step 306). The command may originate from a user to which the presentation was being 
directed, from a user monitoring the presentation for someone else, or from an automatic process 
(e.g., automatically sequencing a device used for advertising, warning, entertaining, or teaching based 
on external criteria such as number of passers by, number in attendance, attrition in attendance, or 
time of day). Interjected commands may be initiated by operation of a keypad (e.g., of 1 13 or 1 14), a 
keyboard (e.g., of 1 1 1 or 1 12), or speech (e.g., at 1 1 1, 1 12, 1 13, or 1 14), for example, as discussed 
above. In one implementation more than one access device is simultaneously in use by the same user. 
For example, a monitor of workstation 111 may receive a presentation for use at the same time by the 
same user of a device 1 12, 1 13, or 1 14. Commands may be of the type described in Table 2. 
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TABLE 2 



Command Description 


GUI Action 
Liput to a 
Visual 
Browser 


Speech hiput to a Voice 
Browser 


Keypad Input to a Voice 
Browser 


Present information from 
the immediately 
preceding page. 


Back button 


"Go back" 


00 


Present information from 
the top of the current 
page. 


Pull vertical 
scroll bar to 
top. 


"Go to top" 


0 1 


Follow the current link. 


Click on 
desired link. 


"This Imk" 


02 


Follow the previous link. 


Click on 
desired link. 


"Prior link" 


03 


Skip forward in the text- 
to-speech queue 2 
sentences. 


Space bar. 


"Fast forward" 


04 


Skip backward in the 
text-to-speech queue 2 
sentences. 


Shift-space 
bar. 


"Rewind" 


05 


Accept a new page 
address. 


Click in 
address box. 


"New URL" 


06 


Speak to the voice 
system operator. 






07 


Continue the presentation 
after speaking to the 
operator. 




"Contmue" 


08 


Mute the microphone 
and take only keypad 
commands. 






09 


Present information form 
the current page again. 


Refresh button 


"Repeat" 


* 
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15 



20 



Command Description 


GUI Action 
Input to a 
Visual 
Browser 


Speech Input to a Voice 
Browser 


Keypad Input to a Voice 
Browser 


Provide operating 
instructions. 


Help button 


"Help" 


0 


Present a list of available 
links. 




"Give me the links" 


# 


Identify a link. 


Underscored 
words 


Repeat exactly what the 
speech engine said when the 
link was presented; or say the 
name of the link number 
(e.g., "three"). 


1 to99 


Accept text entered by 
the user. 


Keyboard 
input 


Say the name of each letter to 
spell out the text desired to be 
input (e.g., to enter "cod" say 
"see oh dee"). 


Press a two digit 
designation for each 
letter. The first digit is 
the three letter group. 
The second digit is the 
position in the group. 
For example, "cod" 
would be 23 63 31 
because the groups are 2 
for abc, 3 for def, etc. 



When the presentation has been completed (audio or visual), and no user command 
has interrupted the presentation, further steps of the method may be delayed (step 308) for a suitable 
time to give the user opportunity to give any suitable command as discussed above. Upon lapse of the 
delay or on entry of a command, the command (or a default command) may be executed (step 310) 
within the context of the current presentation (e.g., "Go back" is relative to the current presentation). 
After command execution, the method is repeated beginning with step 302. 

Preparation of derivative content may be accomplished according to various aspects 
of the present invention according to a method comprising one or more of the steps of: obtaining 
content for presentation in accordance with guidance, locating a node in accordance with the 
guidance, and reducing the complexity of content to be presented. Content and guidance may be 
represented by respective trees (e.g., a content tree, a guidance tree). A description of content to be 
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presented may be prepared from which the presentation is made. Such a description may be included 
in the aforementioned content tree or stored in a separate storage area. A separate storage area may be 
used for a list, array, or tree of references, where each reference may be a pointer to a node of a 
content tree. An output tree may be prepared as a copy of various portions of content, guidance, and 
5 variables. An output tree may be traversed to prepare derivative content. 

For example, step 304 of method 300 may be accomplished by the cooperation of 
several processes illustrated by the data flow diagram of FIG. 4 to produce derivative content 440. 
Process 304 receives as inputs primary content 400 (e.g., one or more files, buffers, or messages, as 
discussed above) and an address of the primary content (PC), for example, a URL (PC URL). Process 
10 304 includes find guidance process 402, find model skeleton process 404, analyze primary content 
process 406, reduce complexity process 408, make primary content skeleton process 410, align 
skeletons process 412, get next references process 422, find node process 424, get node by skeletal 
S position process 430, get node by node name process 428, get node by content match process 432, and 

annotate process 436. Each process 402-436 may be performed by transcoder proxy server 125 from 
Q 5 time to time (e.g., in serial or in parallel) at any time data for that process is suitably available. 
F Find guidance process 402 receives the PC URL, obtains suitable guidance associated 

with at least a portion of the PC URL, or obtains default guidance when, for example, suitable 
guidance has not been associated with the PC URL. Preferably, guidance is obtained in a markup 
ff: language, for example, the markup language herein called MASK. Guidance so found is provided to 

H20 get next references process 422 and to annotate process 436. 

Find model skeleton process 404, receives the PC URL, obtains a suitable description 
^ of model content associated with at least a portion of the PC URL, and provides the description to 

align process 412 and to get node by skeletal position process 430. The description is preferably 
stored in the form of a tagged skeleton of the MASK markup language. The skeleton may be included 
25 with the guidance for convenience of access. In such an implementation, find model skeleton process 
404 may be omitted and find guidance process 402 may provide the skeleton as needed by other 
processes. 

When found, the description of model content may be provided as records 415 (e.g., a 
representation of the model content as a tree having nodes, or a description of such a tree). The 
30 description of model content includes an association between a descriptive identifier and a node 
identifier for each node of the model content. For example, records 415 provided by find model 
skeleton process 404 may include for each model node: a single printable character as the descriptive 
identifier (MODEL LETTER) and a printable hyphenated digit string as the node identifier (MODEL 
NODE). 

35 The method used for finding guidance need not be identical to the method for finding 
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a suitable description of model content. In one implementation, guidance includes a reference to a 
suitable description of model content; facilitating a many to many relationship between records (or 
files) containing guidance and records (or files) containing descriptions of model content. Access to 
either guidance or a description of model content may in addition be based on information from a user 
5 account including for example the type of device (or preferences established by the user for a device 
having a particular identification) used in step 302 for which the derivative content is to be prepared. 

Analyze primary content process 406 reads primary content 400 and prepares records 
418 for use by make PC skeleton process 410 and get node by node name process 428; records 419 
for use by get node by content match process 432; and records 420 for use by find node process 424 
10 and reduce complexity process 408. Records 418, 419, and 420 may be organized in any 

conventional manner, preferably as a tree. Any implementation of a tree representation may be used, 
including, for example, a database of the type known as a document object model (DOM). 
2 Descriptions of a DOM are available from W3C, for example, at Internet web site 

<http://www.w3c.org/TR/REC-DOM-Level-l>. The DOM may be stored as a conventional database 
wJ5 (e.g., relational or star). Information in the form of a DOM may exist in memory in any suitable 
=r storage format including array, linked list, graph, tree structure, data structures, combinations and 

:i: equivalents. Typically, a DOM includes a data structure that represents a tree. 

^ A tree is a type of graph having nodes, branches, and leaves. A leaf (e.g., an element 

0 : of content) may be represented as a data structure comprising any object expected to be included in 

{1-^0 content (e.g., text, table, bitmap, applet, link, etc.). Nodes and branches may be represented as a data 
structure comprising a list (e.g., pointers to elements or other lists). One node is typically designated 
the root for accessing any leaf Access to particular content (e.g., a leaf) may be accomplished with 
reference to a node name (e.g., "0-4-3") by beginning at the root ("0") and following a pointer 
(traversing a branch between nodes) for each portion of the node name (e.g., following the 4th pointer 
25 of the list at node "0" then following the 3rd pointer of the list at node "0-4"). 

Analyze process 406 determines, for each node of primary content, an association 
between a node name (e.g., PC NODE, "0-4-3") and a portion of primary content (e.g., a table). 
Analysis may proceed according to the nest level of elements (e.g., tags, attributes, and enclosures) in 
the markup language of primary content 400. These associations may be stored as records 418. Each 
30 record 418 may include a copy of the associated content from primary content 400. Analyze process 
406 determines, for each node of primary content containing text, an association between a node name 
(e.g., "0-4-3") and every word (or other meaningfiil symbol) used in the text at that node. These 
associations may be stored as records 419. Records 419 may be stored in a form of the type known as 
a hash table. Each record of the hash table may have a hash key for a particular word (or meaningful 
35 symbol) in primary content as a whole and a list of node names for each node having content 
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comprising the particular word (or meaningful symbol). Analyze process 406 determines, for each 
node of primary content, a set of numeric descriptions (herein called node statistics) and forms an 
association between a node name (e.g., "0-4-3") and the set or members of the set. These associations 
may be stored as records 420. Node statistics may include any quantitative properties, for example, as 

5 described below with reference to <nodestat />. 

If find guidance process 402 does not find suitable or default guidance, preparation of 
derivative content 440 proceeds on the basis of records 418 and 419. For each node of records 418 
(e.g., in depth first order from root), reduce complexity process prepares derivative content as one or 
more nodes of derivative content. Derivative content 440 is preferably provided in a markup 

10 language. A copy of the primary content associated with the node in records 418 may be identified as 
derivative content or copied to a file, buffer, or message as derivative content. If content at a primary 
content node is too complex, additional or substitute content for one or more nodes of derivative 
content may be prepared. 

± If find guidance process 402 did locate suitable guidance, preparation of derivative 

Ij 5 content 440 in accordance with the guidance may produce a result that is still too complex for 
I: presentation. Preparation of derivative content 440 then proceeds when records 418 are consistent 

^ t with such guidance (e.g., when so indicated by annotate process 436). Reduce complexity process 

408 reads records 418 and may produce derivative content 440 as discussed above. 
(jf In either case, reduce complexity process 408 determines complexity of content at a 

:}:20 node with reference to quantitative properties of the node. For example, quantitative properties 
discussed above with reference to records 420 may be read by reduce complexity process 408. 
Reduce complexity process 408 may compare such quantitative properties to predetermined threshold 
values (e.g., limits). Derivative content, provided by reduce complexity process 440, primarily 
includes content that has been tested or prepared and determined to not exceed suitable limits. 
25 According to various aspects of the present invention, a comparison is made in 

accordance with primary content and model content to facilitate, inter alia, identification of derivative 
content. This comparison may be made between a description of the primary content and a 
description of the model content. Preferably, the comparison is made between a skeleton of the 
primary content and a skeleton of the model content. Make primary content skeleton process 410 
30 prepares a description of primary content in a format suitable for comparison. For example, find 
model skeleton process 404 provides records 415 as a skeleton as discussed above; and, make PC 
skeleton process 410 provides records 416 comprising corresponding information determined from 
records 418. The description of model content includes an association between a descriptive identifier 
and a node identifier for each node of the model content. Records 416 provided by make PC skeleton 
35 process 410 may include for each primary content node: a single printable character as the descriptive 
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identifier (PC LETTER) and a printable hyphenated digit string as the node identifier (PC NODE), 

Guidance is prepared with reference to nodes of the model content. To obtain 
corresponding nodes of primary content, a correlation is made between records 415 and 416. This 
correlation may be accomplished using records 415 as a whole against records 416 as a whole. For 
5 example, in an implementation having a single character for each descriptive identifier, the descriptive 
identifiers for the model may be aligned to the descriptive identifiers of the primary content as a 
correlation between character strings. Align skeletons process 412 generally associates a PC node to 
each model node. A conventional process may be used which accounts for mismatches such as (a) 
PC skeleton having nodes not found in model skeleton; (b) model skeleton having nodes not found in 

10 PC skeleton; (c) model nodes not found in the same sequence as provided in PC skeleton; (d) PC 

skeleton having a node (and all subordinate nodes) recognizable as corresponding to a node of model 
skeleton. Recognition of correspondence between nodes may be accomplished by conventional 
search and alignment processes, for example, of the type known as Smith- Waterman. Align process 
412 provides an association between model node identifiers and PC node identifiers. This association 

15 may be provided as records 417, each record comprising a model node identifier (e.g., "0-2-6") and a 
primary content node identifier (e.g., "0-4-3"). 

The associations discussed above with reference to records 415-420 may be stored 
and accessed in any one or more conventional manners (combined or separate), including in-memory 
tables, arrays, and linked lists; or on secondary storage as files or a database. 

20 Guidance may specify derived content by making one or more references to a node of 

primary content. When guidance is provided without a description of model content (e.g., without a 
skeleton of model content), references may include words (or meaningful symbols) suitable for 
identifying a node of primary content. When guidance is provided with a description of model 
content, references may include node identifiers (e.g,, "0-4-3") and positions relative to the 

25 description of model content (e.g., a skeletal position (SP)). In other words, for each node of 

derivative content to be prepared, guidance may include one or more references for obtaining content 
of a node of primary content to be included as a node of derivative content. For each node of 
derivative content, get next references process 422 provides to find node process 424 a set of 
references. Preferably, the set includes several references so that (a) if use of one reference does not 

30 identify a node of primary content, an alternate reference may be used successfully; and (b) if a 

reference identifies more than one node of primary content or the set of references identifies more 
than one node of primary content, a judgment can be made to resolve ambiguity. When find node 
process 424 has accomplished specification of the next node of derivative content, find node process 
424 may cooperate with get next references process 422 in any conventional manner to obtain each set 

35 of references until all sets have been processed. 
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Find node process 424, for each set of references received from get next references 
process 422 identifies a node of primary content to be included in derived content. In one 
implementation, such identification may be made by associating a flag with a record of records 418. 
In an alternate implementation, the flag may specify a node number of the derived content so that the 

5 organization (e.g., sequence or tree structure) of derivative content may differ from the organization 
of primary content. One or more references provided by get next references process 422 may be 
provided in a conditional construction to facilitate if-then-else or switch (e.g., case) action by find 
node process 424. For example, a conditional construction may permit find node process 424 to use a 
first set of references if the condition is true and use a second reference (e.g., a default), if the 

10 condition fails. Conditional logic may make reference to node statistics 420, described above. Find 
node process 424 obtains a node identifier for each reference and resolves ambiguity among node 
identifiers to determine whether a node is found and if so which node (if several could apply) will be 
designated for inclusion in derivative content. An implementation for resolving ambiguity will be 
discussed below after processes 428-432. 

15 Find node process 424 may receive from get next references process 422 a reference 

to a node of the model. The reference may be a node identifier in any conventional form, including a 
form of the type "0-3-2"). Get node by node name process 428 receives such a node identifier and 
obtains the associated PC node by query (or look up) on records 418. In other words, if the cxirrent 
primary content has a structure that includes a node named as in the model content, then the primary 

20 content node is provided by the get node by node name process 428 to the find node process 424. If 
no node by that identifier exists in the primary content, an indication of that result is returned instead. 
If the set of references provided by get next references process 422 includes more than one such node 
identifier, get node by node name process 428 performs corresponding queries (or look ups) and 
returns respective results. 

25 Find node process 424 may receive a reference to a portion of the model content 

description from get next references process 422. For example when the description of model content 
is a skeleton (e.g., a character string) as discussed above, the reference to a portion (e.g., a substring) 
of the skeleton may include a starting character position and an ending character position. Get node 
by skeletal position process 430 attempts to identify a corresponding portion of the description of 

30 primary content. For example, a query or lookup on records 415 may provide a model node identifier 
associated with the starting character position and another associated with the ending character 
position. A query or lookup on records 416 may provide a PC node identifier for each model node 
identifier. Get node by skeletal position process 430 then returns the resulting one or more PC node 
identifiers. If the PC node identifiers comprise a tree, the root of the tree may be returned. If no PC 

35 node identifier corresponds to the skeletal positions, an indication of that result is returned instead. If 
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the set of references provided by get next references process 422 includes more than one such skeletal 
position, get node by skeletal position process 430 performs corresponding queries (or look ups) and 
returns respective results. 

Find node process 424 may receive a reference as a word, symbol, or phrase (e.g., a 
character string) from get next references process 422. For example, content may be identifiable by 
the title of a table as in "Markets Snapshot". Get node by content match process 432 attempts to 
identify a corresponding PC node matching the word, symbol, or phrase. Get node by content match 
process 432 may form the hashed equivalent of all or a portion of the reference and perform a query 
(or lookup) on records 419 to obtain a PC node identifier. If no PC node identifier contains the 
referenced content, an indication of that result is returned instead. If the set of references provided by 
get next references process 422 includes more than one such reference to content, get node by content 
match process 432 performs corresponding queries (or look ups) and returns respective results. 

As directed by guidance, find node process 424 may resolve relative referencing as to 
any of the returned PC node identifiers. For example, the guidance may indicate that a parent (of the 
node according to the reference) is intended (e.g., by syntax such as "<-" discussed below). Find node 
process 424 may receive zero or more PC node identifiers from processes 428-432 as discussed 
above. If no PC node identifier is returned, find node process 424 may produce no node information 
to annotate process 436 and proceed to acquire the next references fi-om get next references process 
422. If one PC node identifier is returned, then find node process 424 may present that PC node 
identifier to annotate process 436. When one PC node identifier corresponding to a model node (M) 
has been found and more than one PC node identifier is returned fi-om processes 430 and 432, 
ambiguity may be resolved by the following method: 

1. Consider that each reference produces a corresponding list of PC node identifiers; 

2. Form a list of candidate PC node identifiers (Ci..k) where each candidate (e.g., C,) appears on all 
lists; 

3. Compute a score for each candidate and choose the PC node identifier having the lowest score. 
Each score may be computed as follows: 



x=mm(Q,P) 

Sc. = ^ - Mf + {\Q-P\ X 10'} 

Where: Cn is a candidate having a score Sn. 

Each C has P levels in its node name (e.g., a node name of "0-0-3-5" has 4 
dimensions, one dimension for each of 4 levels). 
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M is the model node name having Q levels in its node name. 
And, "minQ" returns the minimum of its arguments. 

The score represents a "distance" from the model node to the candidate node. The 
5 distance is calculated using the square root of the sum of squares of dimensional differences. When 
the candidate node identifier and the model node identifier are of different levels, an arbitrary 
dimensional difference of 1 0 is assigned to each level not appearing in both the candidate and the 
model node identifiers. In other implementations, an arbitrary dimensional difference in the range of 
2 to 20 is used. 

10 When one PC node identifier has been determined by find node process 424 (the 

"found node"), notice of the found node is communicated to annotate process 436. Annotate process 
436 may provide revised content to reduce complexity process 408 for use as derived content. The 
revision applied by annotate process 436 may include summarization, selection, annotation, and/or 
restatement of the primary content associated with the found node. The revision may be 

1 5 accomplished according to any conventional editing process suitable for the content being revised. 
For example, additional text or audio may be added to text content. 

Guidance may direct how complexity and importance are determined and 
consequently how derivative content is prepared. In other words, guidance may identify particular 
measures and limits that are to be used for determining complexity; and, may develop the derived 

20 content based on knowledge of how the average user would react to the content — what information 
may be considered most or least important, what information should be spelled instead of spoken for 
clear understanding and ease of navigation, etc. 

A find guidance process, according to various aspects of the present invention, 
includes any process that accesses guidance for use by a transcoding process. For example, find 

25 guidance process 402 may include the method of FIG. 5A for determining whether to apply default 
guidance. In such a method, default guidance is used when no other guidance is available as 
associated with the primary content. Such default guidance may include indicia of threshold 
conditions or limits used for analyzing content (e.g., measures and limits relative to complexity of 
content, measures and limits relative to expected screen size, characteristics to be tested to recognize 

30 the organization of the content). Default guidance may be accessed in accordance with the results of 
such analysis. 

Identifying suitable guidance may be accomplished according to various aspects of 
the present invention according to a method comprising, one or more of the steps of accessing 
guidance based on an address of the primary content, an address related to the address of the primary 
35 content, an address that may be within the scope of a regular expression, and correlating at least a part 
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of the primary content with content associated with guidance so as to identify such guidance as 
suitable for a region of the primary content. 

In one implementation, guidance is associated with the address of the primary content 
(e.g., the URL of the primary content). Using the address as criteria, a query may be made of a 

5 database that provides one or more records (e.g., a file) having indicia of guidance. 

In another implementation a query is formed by substituting a regular expression for 
part of the address of the primary content. As an example adapted for use with the Internet, content 
may be addressed by a URL that may include a date. By including a date, an archive of content may 
be maintained and accessible by navigation. Web sites that provide news may offer new content on a 

10 daily basis, addressing each page with a date code in the form of YYMMDD for year month and day. 
To avoid having to prepare guidance that is similarly addressed with a date code, guidance that may 
apply to content regardless of date code may be obtained from a database having a query that omits 
the date code, or restates the date code as a regular expression. In other words, when a user demands 
content addressed with a date code, an address suitable for query may be derived from the demanded 

1 5 address by substituting a regular expression for the date code. According to various aspects of the 
present invention, any portion of the address of primary content may be replaced with a regular 
expression. Several regular expression substitutions may be made to form a single address suitable 
for query. Exemplary regular expressions are described in Table 3. Many regular expressions of 
varying syntax are known. So-called wildcard characters constitute a simple type of regular 

20 expression (e.g., and in the filename syntax supported by MSDOS marketed by Microsoft). 
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TABLES 


Example Regular 


Description 


Expression Symbol 




[.]* 


Brackets "[]" enclose a character class definition. Asterisk specifies 
occurrences must be zero or more. Occurrence specifications may be "+", 
"{n,m}" as discussed below. The period describes the character class 
representing all characters except newline. Thus "[.]*" represents 0 or more 
characters not including the newline character. 
Example: usa.[.]* 

Matches: "usa.com", "usa.org", and "usa.nef . 


[\w]+ 


"\w" describes the character class of letters, digits, and underscore Plus 

specifies occurrences must be one or more. 
Example: us[\w]+.com 



26 



46806.00007 



Example Regular 


Description 


Expression Symbol 




= : 


Matches: "usa.com" and "ussn.com". 


[\d]{n,m} 


"\d" describes the character class of digits. "{n,m}" specifies occurrences must 




be at least "n" and not more than "m". Thus, "[\d] {2,4}" represents at least 2 




but not more than 4 digits. 




Example: 5/31/[\d]{2,4} 




Matches: "5/3 1/98" and "5/3 1/2000" 



1^ Content available via a computer network is subject to change v^ithout notice when, 

for example, different computers coupled to the network are administered by independent authorities 
for different purposes. Content changes that are understandable when viewed on a workstation 
monitor (e.g., information is presented in a visually reorganized manner as to relative positions and 
emphasis of information) may, when analyzed from the source markup language, appear as structural 
and/or aesthetic changes. To assure that a consistent presentation is made to an audio device or a 
limited display device, predefined guidance that was suitable for content prior to a change is, 
according to various aspects of the present invention, identified as suitable for similar content 
following the change. Consequently, predefined guidance need not be revised to remain suitable for 
developing derived content from primary content that has changed somewhat. 

In a preferred implementation, guidance is created from model content and stored and 
accessed using directory access structures and methods of the type known as lightweight directory 
access protocol (LDAP), described in RFC 2251 and 2252 with reference to ITU-T X,680 "Abstract 
Syntax Notation One ASN.l.". Guidance as accessed using LDAP is stored as an entry in a directory 
information tree. Each entry corresponds to a node of the tree. Attributes may be associated with a 
node. Attributes include, for example: 

1 . NAME — An identifier of the guidance entry at a node, e.g., a filename. 

2. DESCRIPTION — A user-supplied description of the guidance. For example, a user who has 
prepared guidance using an editor of the type described below with reference to FIGs. 8-10 may 
describe the guidance in his or her own words for future reference, 

3. OBJECT— The guidance itself. May be text as specified in a markup language, e.g,, the 
MASK markup language. 

4. USER ID ~ An identifier associated with a user (e.g,, during a registration process) and 
assumed to correspond to the same user during any subsequent session. By associating a USER_ID 
with particular guidance, different users may enjoy different derivative content fi-om the same primary 
content. Each user may have defined separate guidance using an edit process described below with 
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reference to FIGs. 8-10. A value of USER ID may correspond to any group of users (e.g., all users). 

5. MODEL_ADDRESS - The complete address of model content used in the preparation of the 
guidance TEXT. For example, a complete URL. 

6. APPLIES_TO_ADDRESS An address to which the guidance OBJECT should be applied. 
5 For example, an address used in step 502, such as a primary address or an address comprising one or 

more regular expressions. For example, the user having created the guidance OBJECT from model 
content, can also specify the APPLIES TO ADDRESS as the same as the model content URL or as a 
URL comprising one or more regular expressions for a wider application of the guidance OBJECT. 

7. HAS_REGULAR__EXPRESSION - A flag indicating that the APPLIES_TO_ADDRESS 
10 includes at least one regular expression. 

8. IS FIRST — A flag indicating that the derivative content determined from the guidance 
OBJECT should be provided to the user interface first. In response to an access request using LDAP, 
a list of entries may be provided. Only one entry of such a list is permitted to have a set IS FIRST 
flag to indicate that derivative content determined from guidance OBJECT of that entry is to be 

15 presented before derivative content as determined from guidance OBJECT at other entries of the list. 

Use of the directory information tree and access protocol discussed above facilitates 
obtaining guidance for the preparation of derivative content as a set of pages. A search for suitable 
guidance conducted by find guidance process 402 (step 502) may return a plurality of entries (i.e., 
nodes) each having a suitable APPLIES_TO_ADDRESS attribute. Such a search may seek entries in 

20 accordance with one or more target attribute values. In one implementation, a list of entries returned 
from an LDAP search may be further analyzed by a script (e.g., regular expressions may be evaluated 
in the PERL programming language). For example, the primary address may match the respective 
APPLIES TO ADDRESS attributes of several entries exactly; or, the primary address may be within 
the scope of the regular expression of one or more APPLIES_TO_ADDRESS attributes. When 

25 results of a search identify more than one entry, entries may be processed in any order (step 503). A 
return to the calling process may be made after step 502. Preferably, an entry has been identified as a 
first entry, for example, as by flag IS FIRST, discussed above. When primary content is to be 
presented as a series of pages (e.g., an aggregation as discussed below), the first entry may correspond 
to the first page of the set. When primary content is to be presented as a hierarchical set (e.g., a 

30 summarization as discussed below), the first entry may correspond to a first page having a table of 
contents, each line of the table comprising a link to a subordinate page of the hierarchy. A return to 
the calling process may be made after step 503. 

After determining a suitable first entry to process, guidance corresponding to 
identified entries may be used to prepare derivative content in a repetitive or recursive manner until 

35 derivative content has been prepared for all identified entries. 
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According to various aspects of the present invention, guidance prepared with respect 
to model content may apply for currently requested primary content (step 302). A process for 
determining whether to apply such guidance to primary content may include determining whether a 
sufficient correlation exists between a description of the model and a description of the primary 
5 content. A description may include a subset, a summary, or a restatement of all or a portion of the 

content being described. The restatement may be in a markup language. The markup language used 
for the description may differ from a markup language used to express the content being described. Li 
a preferred implementation, the description of the model content and the description of the primary 
content are prepared in a tagged element of a markup language. 
1 0 The description of content for, inter alia^ ascertaining suitable guidance, is captured, 

maintained, and used for analysis in what is referred to herein as a skeleton (e.g., identified in the MASK 
language as enclosed in the tags <sklt> and </sklt>). A skeletal description, according to various aspects 
ijo of the present invention, emphasizes the structural aspects of content (e.g., hierarchy of nodes and nesting 

' Jf of tables) and de-emphasizes the information conveyed by the content (e.g., particulars of a news story or 

Mi. 15 stock price). Consequently, correlation of skeletal descriptions is efficiently accomplished by any 
t„' conventional technique of string comparison and alignment, as discussed above with reference to process 

|y 412. A partial list of elements of a skeletal description are described in Table 4, adapted for description 

J;^.^ of content expressed in the HTML markup language. By using a single character (e.g., one byte) for a 

gl structural feature (e.g., the beginning of a table definition), a compact description results for efficient 

JJf 20 generation, maintenance, and analysis. 

Make PC skeleton process 410 may refer to statistics 420 for each node to determine a 
suitable description for the respective node. When a node includes a variety of features (e.g., a mix of 
text, digits, and links), one description may be chosen to correspond to the predominant feature. The 
descriptions below may be understood to refer to the predominant feature, for example, corresponds 

25 

to a node having predominantly links, though text and digits may also be present. 

TABLE 4 



35 



Skeleton 
Symbol 


Corresponding Feature Of 
Content In Markup 
Language 


Description 


Handh 


<html> and </html> 


Encloses content written in HTML. 


Bandb 


<body> and </body> 


Encloses the body of an HTML document 


Tandt 


<table> and </table> 


Encloses a table. 


Randr 


<tr> and <J\x> 


Encloses a table row. 
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10 



15 



20 



Skeleton 
Symbol 


Corresponding Feature Of 
Content In Markup 
Language 


Description 


Dandd 


<td> and <td> 


Encloses a table cell 


Fandf 


<form> and <Vform> 


Encloses a form. 


I andi 


<mput> 


Defines an input element such as a radio button or text 
box. 


P andp 


<p> and </p> 


Encloses a paragraph. 


U andu 


<ul> and </ul> 


Encloses an unordered list. 


O ando 


<oI>and<ol> 


Encloses an ordered list 


Landl 


<li> and <J\i> 


Encloses a list item. 


V and V 


<div> and </di\> 


Encloses division within an HTML document. 


Yandy 


<layer> and </layer> 


Encloses tfie defmition of a layer. 


* 


text 


Represents a passage of text of 300 words or more. 


$ 


text 


Represents a passage of text of 6 to 299 words. 


% 


text 


Represents a passage of text less than 6 words. 


# 


digits 


Represents any number of digits forming a numeric 
value or a date in numeric format. 


@ 


link 


Represents a link comprising 6 or more words 




link 


Represents a link comprising less than 6 words. 



As discussed above, preliminary to ascertaining a correlation between model content 
and current primary content, a description of model content may be obtained (step 504) as associated 
with an address (e.g., steps 502 and 503). The description of model content may be a tagged element 
in the OBJECT attribute which comprises text in a markup language. A description of primary 
content (or a region of primary content) is prepared (step 504) as discussed above with reference to 
process 410. 

Conventional string alignment and comparison may be used to determine an extent of 
correlation (step 508) between the skeletal description of the current content (e.g., retained in a 
temporary memory) and the skeletal description of the prior content (e.g., retrieved from the <sklt> ... 
</skl1> tagged section of guidance in the MASK markup language), A conventional Smith- Waterman 
process may be used. 

In an alternate implementation, a historical model of the type known as a hidden 
Markov model may be used. The historical model may reflect the probability of particular content 
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(e.g., a sequence of structural features) appearing at a particular position in the skeletal description, at 
a particular screen location, or in logical or positional association with other particular content. The 
historical model may be developed with reference to one or, more preferably, many examples of 
primary content. To determine whether sufficient correlation exists between current primary content 
5 and the historical model, a second model of the current primary content may be prepared and the 
historical and second models may be compared in any conventional manner. 

If the extent of correlation is sufficient, the guidance associated with the description 
of prior content is returned as suitable guidance (step 510 and 514). Otherwise default guidance may 
be returned (steps 510 and 5 12). In determining whether the correlation is sufficient, a logical or 

10 numeric threshold and comparison may be used. For example, if it is sufficient to find a match, then 
the binary logical assertion that a match has been found determines the result of the test (step 510). In 
an alternate implementation, a weighted sum of the extents of matches found (e.g., allowing for 
intermittent mismatches to also be present in the correlation) may be compared to a threshold numeric 
value. When the weighted sum exceeds the value, for example, sufficient correlation may be 

15 concluded. In another implementation, probabilities derived from a model are combined in any 
conventional manner to conclude sufficiency. For example, if matches are found in portions 
associated with a low probability of change or mismatches are associated mostly with portions that 
have a high probability of change, sufficient correlation may be concluded. 

A reduce complexity process according to various aspects of the present invention 

20 includes any process that reorganizes content for better navigation. Because content to be presented is 
expected to be too complex to present in audio or on a limited display, the content is reviewed to 
recognize its organization. Complex content is content that is difficult for an average user to 
understand or navigate when presented in audio; and content that is difficult for an average user to 
understand or navigate when presented in part on a screen of limited display area and/or resolution. 

25 Navigation includes the process of understanding the presentation, recognizing links, selecting a 
desirable link, and commanding access to the information associated with the desired link. 
Complexity may interfere with one or more of these steps of navigation. 

If the organization permits recognition of regions within the content, further analysis 
is accomplished for each region more or less independently. A region is a portion of the content 

30 having few contextual associations to other portions of the content. Contextual associations may be 
structural or aesthetic. Structural differences may include differences in presentation, for example, 
text presented in a first table may be fairly dissociated fi-om text not in the table, text placed at a 
distance from the table, and text presented in a second table. Aesthetic differences may include 
differences in appearance, physical location, or arrangement, for example, when the content (e.g., 

35 expressed in a markup language) specifies a first passage of text and links on a first backgroimd and a 
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second passage of text and links on a second background, the association between the first and second 
passages of text may be treated as weak by implication and a different region may be defined for each 
passage. Many pages have headline information at the top and legal notices and general information 
presented at the bottom of a large screen. Blank space may indicate a separation of portions. Relative 
5 physical location may indicate importance. Having recognized differences defining regions, and 

having recognized probable importance that may be associated with each region, further analysis may 
be accomplished for each region in rank order of relative importance. Based on the above criteria, a 
region for further analysis is selected (step 524). 

Analysis of content to be presented is accomplished to ascertain whether the selected 

10 region is amenable to further subclassification (e.g., forming a hierarchical set of pages representing 
the content of the region). For example, the complexity of the region may be determined (step 526) 
by calculating one or more measures of complexity. When the content is expressed in a markup 
language, measures may include number of bytes in the markup language to express the content of the 
region, number of bytes of content in the region, and number of links in the region. 

15 If the region is not determined to be too complex (step 528), identifiable portions of 

the region may be analyzed for relative importance (step 530) and derivative content 440 may be 
constructed. Derivative content may include content in a markup language not necessarily the same 
as the markup language of primary content 400. The derivative content may include the portions of 
the region positioned (e.g., absolutely from screen top right comer, relatively to any desired position 

20 of the expected screen, or relatively to other portions of the region), highlighted, or sized, in 
accordance with the rank order of importance (step 532). 

On the other hand, when the region is determined to be too complex (e.g., when one 
or more measures or weighted measures exceed a limit) a set of divisional members of derivative 
content may be prepared (step 534), each having a part of the selected region's content. The rank of 

25 importance of each divisional member may be ascertained (step 536) in a manner as discussed with 
reference to step 530. And, derivative content may be prepared as including a list of links to each 
divisional member of the set. Any suitable summarizations and annotations may accompany the links 
in the derivative content. Some content identified for presentation (e.g., by annotate process 436 or 
by primary content 400) may be omitted from the set. 

30 Having developed derivative content in a preliminary form (steps 532 and 538), any 

conventional style sheet (or cascading style sheets) may be applied (step 540) to affect font, position, 
background, and the numerous other content properties conventionally controllable via style sheets. 
The derived content may be revised (step 542) for in-line styles or references may be made to style 
sheets in files or messages maintained separate from the derived content. 

35 Derivative content for all divisional members may be prepared while the primary 
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content is available and being analyzed. Alternatively, when a set of divisional members has been 
identified to be prepared (step 534), the first member (or summary of the set) may be prepared in step 
542, and the remaining members may be prepared w^hen demanded (e.g., a link is followed). When 
presenting a member of a set, a conventional cookie may be sent and or updated to simplify 
5 navigation among members of the set or to indicate that preparation of derivative content of another 
member is desired. Derivative content for all regions may be prepared by repeating steps 524-544 for 
all pages and all regions (step 544). 

After step 544, control may return to the calling process. 

In a preferred implementation, annotated content is stored in a tree (e.g., a DOM) as 
10 discussed with reference to analyze primary content process 406 and records 418. Records 418 are 
retained in memory as the output tree. Reduce complexity process 408 receives notice of a PC node 
identifier that is ready for further processing from annotate process 436. The received PC node 
identifier identifies a node (corresponding to a region in step 524) for complexity determination. 
Children of the identified node correspond to content portions (step 530) or member pages of a set 
15 (step 534). Steps 524-538 may be accomplished by applying to the identified node a set of rules. 
Rules may be applied in any order or organized as nested if-then-else (or case) statements to be 
applied in a fixed order. Actions that may be taken according to a preferred set of rules are described 
in Table 5. 

In Table 5, a dominant node may be defined as a node having a large percentage of 
20 the content of the primary content. The percentage may be determined using statistics (e.g., number 
of b>tes) applied to this node divided by statistics (e.g., number of bytes) applied to the top node of 
the tree. For example, a form may be dominant when the number of bytes of text content to be 
presented for the tag pair of this node (e.g., from <form> to </form>) divided by the number of bytes 
of text content to be presented in the page as a whole (e.g., from <html> to <yhtml>). If the ratio is 
25 greater than 2/3 (i.e., 67%), the node may be considered dominant. 



TABLE 5 



30 



Predicate Action 


Description 


Conditions Precedent 


DISCARD 


Do not present anything 
from this node 


(a) when this node is substantially not text (e.g., bit 
map data, audio, etc.); (b) when this node has been 
flagged as removed (e.g., by operation of button 916); 


USE AS IS 


Present node contents 
without further analysis 


(a) when this node corresponds to an HTML <head> 
tag; (b) when this node is a dominant HTML <form> 
section; 
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Predicate Action 



Description 



Conditions Precedent 



AGGREGATE 



10 



Present node content in 
part and indicate with a 
link that further content 
is available for 
presentation (e.g., a link 
to "more"). 



(a) when this node and its parent are both 
substantially text, and this node has few children, and 
this node is not designated first (e.g., flag IS FIRST 
is not set), and this node's content is amenable to 
presentation on one display screen or in one audio 
recitation; (b) when summarization has or will effect 
more than a maximxmi number of links, the remaining 
links are aggregated; 



SUMMARIZE 



15 



l; 20 



25 



Present a table of 
contents, each line 
comprising a link to a 
portion of the node 
content. 



(a) when this node is an HTML <form> section, but is 
not dominant; (b) when this node and its parent are 
both substantially text; and this node is not designated 
first (e.g., flag IS FIRST is not set); and this node has 
few children, and this node's content is amenable to 
presentation on one display screen or in one audio 
recitation; (c) when this node's p^ent is not 
substantially text, and this node is substantially text, 
and this node is not a dominant node, and this node is 
not designated first (e.g., flag IS_FIRST is not set); 
(d) when this node is a nested HTML <td> tag, and 
this node has few children; (e) when this node is an 
HTML <td> tag but not a nested table, and this node 
is not a dominant node; (f) when this node is not 
substantially text, and this node is not an HTML <td> 
tag, and this node is not a dominant node; 



ANALYZE 
CHILDREN 



30 



Do not present anything 
at this (parent) node; yet, 
continue with first child 
node of parent node. If 
parent has content, build 
sub-tree and move 
content to children 
nodes. 



(a) when this node is an <huiil> tag (e.g., a page 
having multiple <html> tag pairs); (b) when this node 
is an HTML <body> tag; (c) when this node is a 
nested HTML <td> tag, and this node is has many 
children; (d) when this node is an HTML <td> tag, 
but not a nested table; and this node is a dominant 
node; (e) this node is not substantially text, and this 
node is not an HTML <td> tag, and this node is a 
dominant node; 
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A method for presenting derivative content according to various aspects of the present 
invention includes any method comprising one or more steps of identifying styles suitable for audio 
and/or visual presentation; presenting summarized, selected, annotated, and restated content; or 
simultaneously acquiring and analyzing user input during presentation for processing an interjected 

5 command from the user. A method in an implementation called from step 306 of method 300, 

discussed above, begins by identifying one or more styles that may be suitable for the information to 
be presented. For example, in the system discussed above with reference to FIG. 2, voice browse 
process 216, having received content (e.g., PAGE2) in a markup language (e.g., VoiceXML), may 
identify styles (step 602) from the content (e.g., in-line or referenced) or identify suitable default 

1 0 styles. Default styles may be identified by processes performed by voice browser server 123 with 
reference to indicia of the type of audio device 202 which may be implied or explicit in prior 
communication (e.g., registration of device 202 and its user with the authority providing voice 
browsing services; or at step 242 by additional codes or cookies accessible to voice browser server 
123), Style information may refer to a predefined or user defined dictionary of terms with instructions 

1 5 on preferred pronunciation. For particular terms, instructions may direct that the term be spelled 
instead of being pronounced. 

After styles have been identified for application (or have been applied), two parallel 
execution paths may be supported. First, the recitation and/or display of derivative content (step 604) 
is directed to the user's device (e.g., 202). 

20 Second, while presentation is in progress, input (e.g., audio or actuation of keypad 

switches) is received and accumulated for recognition (e.g., speech recognition or multiple key 
sequence recognition) (step 606). If the input is recognized as including a command (step 608), 
recitation and/or display may be interrupted (step 610). An acknowledgement of the command may 
also be presented. The context of the command (e.g., the time the conrnaand was initiated, or the 

25 position in the presentation (e.g., TTS queue or content being displayed)) is noted. 

When the presentation is complete, when no interjected command has been identified, 
and when the interjected command and its context have been noted, a return to the calling process 
may be effected. Indicia of the interjected command, if any, and its context may be returned as well. 

A message sequence provided by processes operating in a system according to 

30 various aspects of the present invention may provide a user interface for a device having a display of 
limited visible area and/or resolution (herein called a limited display device). A limited display user 
interface, inter alia, permits user access to information stored on a network (e.g., navigation of the 
Internet by following hypertext links). For example, in message sequence 700 of FIG. 7, processes 
cooperate to make a request for information and to present information received. For clarity of 

35 explanation, the Internet and its conventional protocols perform the functions of network 130; 
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information is requested by following a link; and information is received in the form of a web page. 
Message sequence 700 presumes that a connection oriented link or a connectionless link (e.g., 171, 
173, or 175) is already available (formed as discussed above or in any conventional manner) for 
communication and that information with a link (e.g., a hypertext link) is currently being displayed on 
5 limited display device 702. 

Limited display device 702 represents any access device having, inter alia^ a display 
of smaller area or less resolution compared to a conventional monitor of workstation 111. Examples 
of limited display devices include (a) wireless device 1 12 which may have a text only display, a 
monochrome text and graphics, or a pocket size color display; (b) telephone 113 which may have a 

10 video display panel having a diagonal measurement of less than 8 inches (10 cm); and (c) cell phone 
114 which may have a display similar to those discussed in (a) and (b). Limited display device 702 
for clarity of explanation is assumed to have minimal computing capability — merely sufficient for 
performing I/O process 706. I/O process 706, performed by limited display device 702, accepts user 
input demanding access to the information addressed by the hypertext link and sends (step 730) a 

15 suitable request to browse process 708. 

In an alternate implementation, limited display device 702 may have computing 
capability sufficient to perform browse process 708 (e.g., Internet Explorer marketed by Microsoft for 
operation on wireless devices). If so, then browser server 704 may be omitted with concomitant 
changes to message sequence 700. 

20 Browse process 708 includes any process capable of participating in the conventional 

communication protocols associated with network 130 (e.g., TCP/IP, WAP, HTTP, etc. for wireless 
devices and/or the Internet). Browse process 708 may be performed by browser server 704. For 
limited display devices of the type described above with reference to telephone 113 and cell phone 
114, browse process 708 may be performed by voice browser server 123 in place of browser server 

25 704. Analogous to the discussion above, the functions of browser server 704, transcoder proxy server 
125 and ISP server 124 may be performed by any number (including one) of computers at any number 
of physical locations. 

The functions performed by MASK database 230, browse process 708, transcode 
process 218, route process 222, and serve process 224 may correspond to the functions described 

30 above for messages relating to URL3, URL4, PAGE3, and PAGE4 (i.e., steps 746-760 of FIG. 7 

correspond to steps 266-280 of FIG. 2). URL3 may include identification of browse process 708 so 
that transcode process 218 can prepare PAGE5 (in place of PAGE4 of step 280) in a form suitable for 
use by browse process 708. 

Browse process, on receipt of information (e.g., PAGE5) fi-om transcode process 218 

35 (step 760) prepares information (e.g., message PRESENTATION) directing limited display device 
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702 to make a suitable display corresponding to the information requested (step 730). PAGE4 may be 
in a markup language (e.g., HTTP, XML, or WML). PRESENTATION may be conveyed via a 
proprietary protocol adapted for limited display device 702 (e.g., conforming to device 702's 
manufacturer's specifications). Tailoring for the size and/or resolution of the display of limited 
5 display device 702 may be accomplished by transcode process 218 as directed by guidance from 

database 230; and/or by browse process 704, Preferably, identification of the limited display device 
702 and/or browse process 704 as received with URL3 (e.g., as discussed above with reference to step 
246) is used by transcode process 218 to obtain guidance fi-om database 230 so that PAGES requires 
merely application of one or more conventional style sheets to provide a suitable PRESENTATION. 

10 In other words, if the requested page exceeds the complexity suitable for limited display device 702, 
PAGES may include a member of a set of pages, as discussed above. 

A method for transcoding, according to various aspects of the present invention, may 
include accessing guidance, accessing primary content, and preparing derivative content by applying 
the guidance to the primary content. Applying guidance may include parsing the guidance to identify 

15 directives, and executing the directives. Directives may conform to a language specification such as a 
markup language. Parsing may include reading guidance, identifying language syntactic elements, 
recognizing symbols both built-in and user-defined, and ascribing meaning to the symbols in 
accordance with the semantics of the language. The ascribed meaning may be conveyed to an 
executive process on the completion of parsing a symbol or directive (e.g., the parsing process and the 

20 executive process may cooperate in a manner similar to a conventional interpreter) or conveyed as an 
object code for execution at another time separate from the time of parsing. Guidance may include 
directives stated in a markup language of the type conforming to SGML. Descriptions of SGML are 
available from the International Standards Organization as ISO-8879. For example, an 
implementation of the MASK language described herein may conform to SGML. A parsing process 

25 in one implementation includes the fimctions of a conventional parser for parsing XML. 

In one implementation, primary content, guidance, and derivative content are each 
expressed as a sequence of characters (e.g., all or a portion of a sequential file; or all or a portion of a 
record of a database). A sequence of characters conforming to a markup language generally includes 
a series of elements. Each element includes either (a) one tag; or (b) a pair of tags surrounding an 

30 enclosure. Syntax (e.g., reserved characters) distinguishes tags from enclosure which may be text or 
any number of elements. A tag may be an empty tag, a single tag, a start tag or an end tag. A tag 
includes syntax (e.g., and ">") and a tag-identifier. A tag may not include an element between its 
"<" and syntax. The syntax encloses one tag-identifier describing the type of tag. A start tag 
includes its tag-identifier and may also include one or more attribute-value pairs. A single tag further 

35 includes syntax indicative of a single or empty tag (e.g., a before the end-of-tag syntax). An 



37 



46806.00007 



empty tag is similar to a single tag but has no attribute-value pairs. An end tag includes the same tag- 
identifier as used in a corresponding start tag and includes syntax indicative of an end tag (e.g., a 
immediately following the start-of-tag syntax). For an empty tag or single tag, any semantic meaning 
that may be associated with the tag-identifier may be asserted on all directives following the tag. In 

5 this first respect, guidance is subject to language processing in a manner similar in some respects to 
the linear sequential interpretation and execution of program statements of a conventional procedural 
programming language (e.g., FORTRAN). Such an empty or single tag may have no further influence 
beyond an end tag when the empty or single tag is enclosed by a pair of tags that mclude the end tag. 
For a pair of tags, the semantic meaning that is associated with the tag identifier may be asserted on 

10 the enclosure (if any). In other words, the scope of assertion is bounded by the pair of tags. When 

paired tags are properly nested, the influence of outer enclosing tags is cumulative on all enclosures to 
any depth. In this second respect, guidance is subject to language processing and executed in a 
manner similar in some respects to block structure for defining the scope of a symbol in a 
conventional procedural programming language (e.g., ALGOL and its progeny). In a preferred 

] 5 implementation, primary content and derivative content are expressed in conventional XML and 
guidance is expressed in a different language, for example, MASK having some features similar in 
some respects to XML (e.g., in MASK, the tag distinguishing syntax and composition of empty, 
single, start, and end tags is common to XML). 

Significantly, among other things, MASK includes tags for conveying semantic 

20 meanings that cannot be easily conveyed (if at all) using conventional XML. MASK also uses syntax 
and variable types not found in XML. 

According to various aspects of the present invention, guidance is captured, 
maintained, and implemented using a markup language herein called MASK. A partial list of 
elements of the MASK markup language is presented in Table 6. 

25 Guidance may be provided in a record or file to a transcoding process. Generally such a 

record or file includes at least one set of <mask> tags (e.g., <mask> and </mask>). Between such tags a 
context may be defined in a manner that supports scope of references. A reference generally is a symbol 
that identifies by semantic and/or syntactic context a property of the symbol to be used in further 
processing. For example, a current value (e.g., any suitable property) is substituted for the name of a 

30 variable (e.g., any declared symbol). Semantic rules define the scope of a reference so that language 

processing of context is deterministic. It is desirable to assure that identification of a suitable property is 
deterministic, including where the symbol has been overloaded (e.g., same symbol declared in different 
contexts, or intended polymorphism). Scope rules may take into account sequential and/or hierarchical 
context. Context may include semantic state that precedes or surrounds a particular occurrence of the 

35 symbol. For example, semantic state may be intended by a programmer (e.g., nested control and/or data 
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structures set forth in the guidance) and later determined by the transcoding process in any manner (e.g., 
language processing by a stand alone parser, a parser cooperating as part of an interpreter, or a parser 
cooperating with a compiler). 

Guidance, according to various aspects of the present invention, may include semantic or 
^ syntactic elements (e.g., tags) that are reserved (e.g., predefined by a guidance language definition), 

declared (e.g., by a declarative function of the guidance language), or determined (e.g., take a value that is 
determined by one or more declarative functions that v^ere performed before a meaning of the semantic or 
syntactic element is to be used). For example, the guidance language may include the reserved tags and 
syntax described in Table 6. 

10 

The MASK language includes two general variable types: string and tree. Declaration of 
a string variable is generally accomplished with the single tag <defvar />. Operations on a string variable 
are directed by elements including, inter alia, <assign>, <substring>, and <translate>. Strings may have 
numeric value (e.g., "a temperature of 99.8 degrees" may have the decimal value 99.8) and be subject to 
^ ^ arithmetic operations. Arithmetic operations are directed by elements including, inter alia^ <expr>. 

String variables (including those having numeric values) may be analyzed using elements including, inter 
alia, <cond>. The value of a string variable may be used as the name of another variable of any type 
using a prefix. The value of a string variable may be used as the name of a subroutine using the 
element <call>. The value returned fi-om an evaluation may control program flow using the elements 
<ifi> and <call>. The value of a string variable may be designated for output using the element 

20 

<variable>. 

Declaration of a tree variable is accomplished generally with the pair of tags <defvar></ 
defVar>. Operations on a tree variable are directed by elements including, inter alia, language processing 
of its value with <eval>; repeating operations on each node with <for_each>; combining the tree with 
25 current guidance with <include>; combining the tree (or a portion of the tree) with current primary 

content with <module>, <description>5 and <selection>; pruning with <node>; creating a named node in 
the tree with <element>; inserting content into the tree with <header> and <tail>; and designating the tree 
(or a portion of the tree) for output with <description>, <selection>, and <value_of>. Tree variables may 
be analyzed using elements including, inter alia, <defined> which returns a conditional value, <nodestat> 
which returns a numeric value, <value of> which returns a string value, and <regex> which returns a tree 
value. A portion of a tree may be identified as a current context for further processing using elements 
including, inter alia, <node>, <for_each>, and <traverse>. A tree may be used for communication with 
another server using elements including, inter alia, <module> <selection> and <postfield>. 

In Table 6, the following symbols in italics are used to describe information specified by 
3 5 the programmer: (1) string indicates a character string literal that may be enclosed in quotation marks 
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and must be enclosed in quotation marks if it includes white space; (2) number indicates a string value 
consisting of digits that may be interpreted as having a numeric value; Q)n,m indicates a pair of numeric 
values; (4) node jd indicates a absolute or relative specification of a node of a tree (e.g., "1-1-0-0-5" or 
"«"); (5) name indicates a symbol that has been declared; a name symbol is imderstood to be a reference 
5 to a target; (6) user-defined indicates a name that has been declared as the name of a subroutine; (7) 
attribute-name indicates a name that has been declared as an attribute in a subroutine declaration; (8) 
XPathspec is a string that specifies a set of references as defined by the XPath subset of XML described 
by W3C; and (9) text indicates data that may include any characters and elements including <elements> 
not reserved in the MASK language. 

10 

TABLE 6 





Guidance Tag 


Purposes of the Language Element and Functions Performed 
by a Language Processor 


15 


PROCEDURAL ELEMENTS 


Procedural elements affect the state of parsing guidance. The 
serial execution of guidance may be altered by procedural 
elements to effect performance of conditional processing, 
exception handling, repetition, recursion, and call and return 
behaviors. 


20 
25 


<mask> ...</' mask> 
version=5/ri/2g 


Encloses MASK language elements for language processing. 
The start tag invokes an instance of a MASK language processor; 
the end tag terminates this instance of language processing and 
returns to the enclosing execution environment. The version= 
attribute is followed by a string, e.g., "1 .2" that identifies the 
version of the MASK language for which language processing is 
desired. 


30 




Perform the operations of the start and end tags of an <output 
file=stdout> element upon encountering the start and end tags of 
the <mask> element if an <output> element is not explicit. Scan 
the <mask> enclosure for subroutine information and create a 
subroutine stack frame. Initialize the variable and parameter 
frames. Initial frames may include values for conventional 
environment switches. 


35 


<eval> ... </ eval> 


Characters between the tags are subject to language processing 
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10 



15 



20 



25 



30 



35 



Guidance Tag 


Purposes of the Language Element and Functions Performed 
by a Language Processor 




according to the MASK language of the current version. See 
Example 1 below. Used for macros where subroutine declaration 
may be less readable. 

At the start tag, invoke an instance of a MASK language 
processor (e.g., having state and context different from the 
current language processor). At the end tag, terminate the 
instance. All enclosing contexts are reachable for finding target 
values for references. 


<if>..,</if> 


Encloses a <cond> element, and at least one of a <then> element 
and an <else> element all at the same level. Effects a branch to 
the <then> element or to the <else> element as a result of 
evaluating the conditional expression specified by the <cond> 
element. 

Note the position of the end tag for use by the <then> and the 
<else> elements. 


<then>... </then> 


If the <cond> element at this level returns a *true" value, the next 
guidance node is set to the node of the start tag of the <then> 
element at the same level. At the end tag, the next guidance node 
is set to the node of the end tag of the <if> element. The 
enclosure of a <then> element may include any elements 
including, for example, <if> elements and declarations of 
variables and subroutines. 


<else> .,.</ else> 


If the cond> element at this level returns a "false" value, the next 
guidance node is set to the node of the start tag of the <else> 
element at the same level. At the end tag, the next guidance node 
is set to the node of the end tag of the <if> element. The 
enclosure of an <else> element may include any elements 
including, for example, <if> elements and declarations of 
variables and subroutines. 


<loop> ... </ loop> 
var^string 


Encloses elements that will be repeated with the variable 
specified by the var= attribute. The var= attribute will be given 
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20 



25 



30 



35 



Guidance Tag 


Purposes of the Language Element and Functions Performed 
by a Language Processor 


min=number 
mecK=number 
inc~number 


an initial value specified by the min= attribute, incremented by 
the value specified by the inc= attribute with each loop iteration, 
and given a final value specified by the max= attribute. 

At the end tag, the next guidance node is set to the node of the 
start tag. At the first visit to the node of the start tag, declare a 
loop counter variable named in accordance with the value of the 
var= attribute, if it does not already exist at this level. Assign the 
value of the min= attribute to the loop counter variable. At 
subsequent visits to the node of the start tag, add the value of the 
inc= attribute to the loop counter variable and then if the result 
exceeds the value of the max= attribute, assign the node 
following the node of the end tag to the next guidance node. If 
the var= attribute is not specified, a unique loop counter is 
maintained by the transcoding process. 


<sleep /> 

houvs=number 

mmutQs=}7umber 

seconds=«wm6er 

milliseconds=number 


Delay further processing until the amount of time specified by the 
sum of the attribute values has lapsed. 


<break/^ 


This element is used within the enclosure of an element 
specifying repetition or recursion, (e.g., a <loop>, <for_each>, or 
<subroutine> element) to interrupt repetition or recursion. 

Set the next guidance node to the node that would receive control 
if the enclosing repetition or recursion element had completed 
normally. 


<for_each> ...</ for_each> 
SQlQCtr^XPathspec 


Use the value of the select= attribute as a node specification to 
obtain a set of references to nodes of the current tree to be 
processed. XPath syntax may be used as described by W3C. 

Repetitively obtain a next node from the set of references. Use 
each next node as a current node context for each repetition of the 
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Guidance Tag 


Purposes of the Language Element and Functions Performed 






by a Language Processor 







elements in the enclosure until no next node can be obtained. 




<traverse> ... </ traverse> 


Specify a subtree in any convenient manner and direct processing 


5 


sc\QCtr=XPathspec 


of the nodes of that subtree. Any combination of attribute values 




id=nodeid 


may be used, except, only one of the select= attribute and the id= 




sp=(w, m) 


attribute may be specified. The value of the select^ attribute may 




keymasks=^^r/«g 


use XPath syntax as described by W3C to provide additional 






members of a set of candidate node references. The reserved 


10 




word '%is" may be used for node id to indicate the current node 






as may be determined by enclosing elements. The id^, sp=, and 






keymasks= attributes provide values to be used for identifying 






one or more candidate nodes references. See the discussion of 






the <node> element for usage and language processing. Use the 


15 




<traverse> element enclosure to describe processing that applies 






to each node subordinate to the designated node. A combination 






of <traverse>, <if>, <cond>, and <nodestat> elements may be 






used to search a subtree for nodes that meet particular criteria. 


20 




Determine the "best" node from the set of candidate node 






references and use it as the current node for the elements in the 






enclosure. Determine the "best" node as discussed above with 






reference to find node process 424. The "best" node is 






understood as the root of a subtree to be traversed. Act on each 


25 




child node of the subtree according to the elements in the 






enclosure. 




<user-defined> ... </ iiser-defined> 


Call a subroutine that is declared anywhere in the current <mask> 




attribute-name -string ... 


element as expanded by <include>, <module>, and <eval> 






elements. The name of the subroutine and the names of its 


30 




attributes (if any) are defined by the user in the <subroutine> 






declaration element. The call may alternatively be a single tag or 






an empty tag (e.g., a <user'defined l> element). Provide values 






for attributes and an enclosure for use by the subroutine. 


35 




Litroduce any output supplied by the subroutine (e.g., text or tree) 
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Guidance Tag 


Purposes of the Language Element and Functions Performed 






by a Language Processor 






as a return value into the calling element in the place of the 


5 




<iiser-defined> element. 






Transfer control to the subroutine corresponding to the tag- 






identifier of the start tag. Find the target node via a name search 






of the subroutine stack. Search the current frame of the 






subroutine stack from the current level to the root level; then 


10 




search the each preceding frame from the level at which the 






current context was begun. Push a new frame onto the variables 






stack and a new frame onto the parameters stack. An identifier of 






the node at this level following this user-defined element may be 






posted in the frame on the parameter stack to be used as the next 


15 




guidance node when the subroutine returns. Attribute-value pairs 






(if specified) and the enclosure (string or tree, if specified) are 






posted in the new parameter stack frame and so passed to the 






subroutine to be available via the <parameters> element. See 


20 




Examples 9 and 10. 


<call> ... </call> 


Call a subroutine as described above with reference to the <mer' 




nwxQ=string 


dej}ned> element where the string value of the name= attribute 




attribute-name— string ... 


provides the name of the <iiser'defined> tag. The string value of 






the name= atttibute may be calculated and provided as an indirect 






reference, (e.g., using the prefix as discussed with reference 


25 




to the <variable> element). 






Processing including the return value is analogous to the <user'' 






defined> element 




INPUT ELEMENTS 


Input elements may identify sources of symbols for declarative 


30 




elements; data for analytical and output elements; and/or further 






guidance for procedural elements. 




<mclude /> 


Use <include> to insert existing MASK language elements (e.g., 




url=5^/ng 1 #name 


<module> or <subroutine> elements) or content (e.g., text) into 






the current <mask> element. 


35 
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Guidance Tag 



10 



Purposes of the Language Element and Functions Perfonned 
by a Language Processor 



Using the url= attribute value as a Uniform Resource Locator 
(URL), access a page in accordance with the URL. Parse the 
page as guidance and insert the resulting tree after the current 
node in the guidance tree. 

Subroutines that are declared in the included page are noted on 
the subroutine stack. Language processing proceeds with nodes 
of the included page in-depth first order before the remainder of 
the guidance tree. 



<module /> 
\xx\=^string \ #name 



15 



20 



25 



30 



35 



Identifies an Internet address for accessing data (e.g., to be 
inserted in a default, variable, or output tree) or processes (e.g., 
for <postfield> elements). The url= attribute specifies a 
conventional Uniform Resource Locator (URL) for read or post 
operations; or, when preceded by the name of a tree variable 
for read and/or write operations. Data returned from the Internet 
address may be parsed (e.g., an HTML or XML document) and 
inserted as a subtree after a current node of an object tree 
determined from the context of this <module> element. 
Elements in the enclosure may be processed in a context that 
includes the data or processmg identified by the url= attribute. In 
other words, targets for references may be found in the data 
addressed by the url=attribute. The enclosure of the <module> 
element may include any MASK elements, including other 
<module> elements. 

Usmg tfie url attribute value as a Uniform Resource Locator, 
access a page in accordance with the url value. Parse the page as 
content and insert the resulting tree after the current node in the 
implied object tree (e.g., the default output tree, or a tree 
variable). If the first character of the url= attribute value is 
the remainder of the url= attribute string identifies a variable 
name having a tree value to be parsed and inserted. If the url= 
attribute specifies a program as opposed to data (e.g., a 
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Guidance Tag 



10 



Purposes of the Language Element and Functions Performed 
by a Language Processor 



conventional ".cgi" object on another server), processing of the 
start tag may include establishing communication with that server 
(or site) and its capabilities or programs in any conventional 
manner (e.g., coimection oriented functions, log-on, discovery, 
reservation, security certifications, initialization). Likev^ise, at 
the end tag, communication may be disestablished in any 
conventional manner (as needed). 



DECLARATIVE AND 
ANALYTICAL ELEMENTS 



15 



20 



A declarative element informs the language processor of a 
symbol that may be used elsewhere in the guidance. The 
declaration may precede or follow a reference to the declared 
item (e.g., subroutine and skeleton declarations may be identified 
in an overview step of language processing before other 
language processing begins). A declaration identifies a target for 
a reference that is within the scope of the reference. Such a target 
may supercede a prior declared target until the prior context is 
restored. An analytical element may be an assignment or an 
expression that provides a value . The value may be binary (e.g., 
conditional expression), numeric, string, tree, subtree, executable 
(e.g., MASK elements), text (e.g., HTML or other elements not 
part of MASK), a combination of these, or otherwise as defined 
below. 



<sklt>...</sklt> 



25 



Encloses the definition of a skeleton. 

Note the location of the tree provided in the enclosure for 
determining a designated node in accordance with the sp= 
attribute of other MASK elements. 



30 



<subroutine> ... </ subroutine> 
name=5/rmg 



35 



Encloses a declaration of a process and may additionally enclose 
the declaration of one or more parameters, constants, and/or 
variables. The name= attribute specifies the name of the 
subroutine for the purpose of calling the subroutine. The 
declaration of the process may include any MASK elements. 
When the process is a function returning a value, the name 
associated with the subroutine may be used in any semantic 
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Guidance Tag 


Purposes of the Language Element and Functions Performed 






by a Language Processor 






context where a value may be properly used ~ the returned value 






being intended. When the process is a procedure having no 


5 




returned value (or returning a null value), the name associated 






with the subroutine may be used in any semantic context where a 






procedural or declarative statement may be properly used ~ the 






performance of the procedure in sequence with other procedural 






and declarative statements being intended. Subroutines may be 


10 




declared in a hierarchical manner (e.g., properly nested elements, 






each element having a pair of tags). A subroutine defines a local 






context for its enclosed declarations. 






Declare the enclosure as the body of a subroutine referred to by 


J 15 




the value of the name= attribute. Insert the body of the 






subroutine into the guidance tree at the end of the guidance tree 






and update the subroutine stack with the name and location of the 






subroutine body. 




<parameters /> 


Within a <subroutine> or <module> declaration, tiie 


•r 20 


select= @attribute-name \ * 


<parameters> element provides access to attributes and the 






enclosure specified in the subroutine call (e.g., the <call> element 






or the <user-defined element). The <parameters> element may 






be used in any semantic context where a value may be properly 






implied ~ the value being made available (e.g., by copy or by 


25 




reference) to the subroutine via the parameters stack. If the value 






of the select?= attribute is preceded by "@", access to an attribute 






of the call is provided. If the value of the select= attribute is 






access to the enclosure of the call is provided. 


30 




Pass parameters to a subroutine using one of two methods: by 






value and by reference. A literal value may be passed as an 






attribute (e.g., a number) or as an enclosure (e.g., texf). See 






Example 4. The value of a variable may be passed as an attribute 






(e.g., a string variable) or as an enclosure (e.g., a tree variable). 


35 




In one implementation values are made part of the top frame of 
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Guidance Tag 


Purposes of the Language Element and Functions Performed 




by a Language Processor 




the variables stack. See Example 5. 




For pass by reference, pass a string value corresponding to the 




name of a variable that has been declared. In the subroutine 




declaration local pointer variables are defined and used with the 




prefix, indicating an indirect reference to the underlying 




variable. See Example 6. 




In an alternate implementation, the name of each parameter is 




used to declare a local pointer variable in the top frame of tiie 




parameter stack; the pointer value referring to the variable passed 




by reference (e.g., the name of that variable). 




At the single tag, determine to which attribute or the enclosure 




access is desired. Search the parameters stack for a named 




attribute. Search the current frame of the parameter stack firom 




the current level to the root level; then search each preceding 




frame firom the level at which the current context was begun. In 




this way, access is provided to parent parameters. If the 




enclosure is to be accessed, use the enclosure in the current 




parameters stack frame. 


<defined> ... </defined> 


This element returns a *true" result when the enclosure, if subject 




to MASK language processing, would return or have a non-null 




value; otherwise returns "false" (e.g., no language elements 




unique to the MASK language were found in the enclosure). 




Scan the enclosure for a <mask> element and if present return 




"true'' else return "false". 


<defVar /> 


Use this element to declare a string variable named in accordance 


name=stri?2g 


with the value of the name= attribute. The value of the value= 


vahxe=string 


attribute (if specified) is assigned to the new string variable. The 




declaration is local to this level. 
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Allocate storage space for the variable (e.g., in the current frame 
of the variable stack or on the heap). Record an association 
between the specified name and the allocated space. For 
example, the association may be maintained in a conventional 
symbol table. The variable stack may be organized as a linked 
list of data structures, each data structure for storing the 
association. The data structure itself or the heap may provide 
space for the value of the variable. 


<defvar> ... </ defv^ar> 
name=string 


Declare a tree variable named in accordance with the value of the 
name= attribute. The enclosure (string or tree) is assigned as the 
value of the new tree variable. 

Allocate space and record an association as discussed above with 
reference to <defvar />. 


<description> ,..</ description> 


When not part of a <module> element, <description> identifies 
that elements in the <description> element enclosure will be 
operating on the default tree (e.g., an output tree for derivative 
content). When part of a <module> element, <description> 
identifies that elements in tiie <description> element enclosure 
will be operating on a copy of the content identified by the 
<module> element. Generally, this element encloses one or more 
<node>, <header>, and/or <tail> elements to give a more 
readable MASK language for operations on the node(s) identified 
by the <node> elements. 

In one implementation, no action is taken on either the start tag or 
the end tag. In an alternate implementation, if the start tag is 
within the scope of a <module> element, and the <module> 
element refers to a source to which write access is denied (e.g., a 
common Internet site on the World Wide Web), a copy of the 
content identified by the <module> element may be made as an 
underlying variable to support read, write, insert, and remove 
operations. 
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Encloses <node> elements thereby designating nodes read from a 
default input tree (e.g., primary content or content identified by a 
<module> element) for inclusion in a default output tree (e.g., an 
output tree for derivative content). 



10 



<node /> 

se[QCtF=XPathspec 

id=node_id 

sp=(«, m) 

keymasks=^/r//?^ 

remove 



[20 



25 



30 



35 



Encloses a node identification and may enclose elements that 
operate on the node so identified. The <node t> single tag may 
be used as desired in place of a pair of <node> ... </node> tags. 
The <node f> single tag may be used to remove a node (i.e., 
prevent it and its subtree, if any, from being included in an output 
tree) by simply including the remove attribute in the single tag. 
The <node> element may be used in any semantic context where 
a string value is implied (e.g., the string value of the node content 
if any is provided) or where a tree value is implied (e.g., the 
subtree having the identified node as a root is provided). The tree 
to which the node belongs may be determined by context: an 
input tree as discussed above with reverence to the <selection> or 
<module> elements; or a variable or output tree as discussed 
above with reference to the <module url=#name> or 
<description> elements. 

The select=, id=, sp==, and keymasks= attributes provide values 
from which a set of candidate node references is determined. The 
node identified by the <node> element is a "best" node selected 
from the set of candidate node references in a manner discussed 
above with reference to process 424. Generally, only one of the 
select= attribute and the id= attribute is specified. The value of 
the select= attribute may be any XPath location path (e.g., for 
this designated node, for this designated node and all its 
children, etc.) as described by W3C. The id= attribute is 
followed by a node number in the form, e.g., "0-1-0-0-2-3", as 
used in XML. The sp== attribute is followed by the position of the 
node in the skeleton, e.g., from the n*^ character through the m^. 
The keymasks= attribute is followed by a text string to be located 
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in the implied input content. When followed by the 








subsequent characters may specify relative identifications (e.g., 


5 






":<-<" means the desired node is the grandparent of the specified 








node). 








Determine a set of candidate node references from all attribute- 








value pairs. Determine the "best" node from the set of candidate 


10 






node references and use it as the current node for the elements in 








the enclosure. Determine the "best" node as discussed above 








with reference to find node process 424. Consider as the current 


•S 






node the "best" node so determined. If the remove attribute is 


ffi 






specified, mark the current node for omission from the output 


^715 






tree. If the remove attribute is not specified, mark the current 








node for transfer to (or inclusion in) the output tree. 


!C 


<node> ... 


</node> 


Encloses a node identification and elements that operate on the 




selectr=XPathspec 


node so identified. All attribute-value pairs of the single <node> 




\d=node_id 


f Iptnpnt rlicr*iiccf*r! nV^nvp m5i\/ hp ncpH "vi/ifTi tlip T\*\\Tf^A <^r\c\i\ei^ 


ry2o 


sp=(«, ni) 




element described here. The paired <node> element may be used 




keymasks=5/r/>zg 


in any semantic context where the single <node> element may be 




remove 




used. The start tag may be used to remove a node as discussed 








above by simply including the remove attribute in the start tag. 


25 






Processing is the same as for the <node> single element. In 








addition, provide the current node identification for use in 








processing all elements of the enclosure. 




<header> 


... </header> 


Generally used within a <description> and further within a 








<node> element; but may also be used with <module>, <defvar>. 


30 






and <element> elements or in any semantic context where an 








insertion into a tree is desired. The enclosure may be a literal 








value or a reference (e.g., variable name) to a string or tree value. 








For example, if the enclosure includes an HTML reference such 








as <a href="www.myart.bin" > text<d>^ the content so identified 


35 






together with the text (if any) is inserted in the current node. For 
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5 




a tree value, insertion is before the current node. For a string 
value, insertion is at the beginning of the string value at the 
current node. 


10 




At the start tag, build a temporary variable to retain the content to 
be inserted. Perform all enclosed elements to determine a value 
for the temporary variable. At the end tag insert the value of the 
temporary variable as described above. 




<tail> ... </tail> 


Same as "^Cheader^ element ex cent thpit the rjoint of insertion fnr a 
tree is after the current node and for a string is at the end of the 
string value at the current node. 




<e]ement /> 
nmnQ=strmg 


Use this element to create a node in a tree to which write access 
is permitted. 


nj2o 




Create a node after the current node in the output tree (e.g., a 
default tree or a tree implied by an enclosing element such as 

'^iimodiile^*''^ and ffive it a name flip value rif fVif* natrif^^ 5itlTiV*iif^* 

The node has no enclosed value; but may be assigned a value, for 
example, using the <header> element. 




<element> ... </ element> 
namG=string 


Same as <element> single tag, except that a value (string, or tree) 
is provided in the enclosure to be assigned to the new node. 


25 




At the start tag, build a temporary variable to retain the content to 
be inserted. Perform all enclosed elements to determine a value 
for the temporary variable. At the end tag insert the value of the 
temporary variable as described above. 


30 


<value_oC> ...</> 


Return the string value obtained from a tree value (specified by 
the enclosure) by ignoring all of the tags in the tree and 
concatenating all content in depth first order beginning at the root 
of the tree. 


35 


<regex> .,.</ regex> 
pMem^strmg 


Return a tree that includes (a) the number of times the value of 
the pattern attribute occurs in the enclosure, and (b) a list of start 
and end positions where the match was found. The pattern may 
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include syntax described above with reference to regular 
expressions. The result tree may contain tags (e.g., <match> and 
<pos>) to identify positions. See Example 7. 


<variable l> 
name=string 


Return the value of the most local occurrence of the named 
variable. A variable may have as a value the name of another 
variable. The "$"sign preceding the name of an indirect variable 
indicates that the value of the indirect variable is the name of the 
variable whose value is to be processed. For example, a fu*st 
variable used as $POINTER returns 6 as an indirect reference to 
the value of a second variable, COUNT; when COUNT has a 
value of 6 and POINTER has a value of "COUNT". 

Find the target node via a name search of the variable stack. 
Search the current frame of the variable stack from the current 
level to the root level; then search each preceding frame from the 
level at which the current context was begun. Return the value of 
the named variable (or its indirectly referenced value). 


<assign> ... </ assign> 
naine=5/r/«g 
type= plus 1 cat 


Declare a string variable named in accordance with the value of 
the name= attribute if it does not already exist. Liitially give the 
variable a zero or null value if it does not already exist. The 
declaration is local to this level. If no enclosure is provided, the 
enclosure is presumed to be zero or null. When no type is 
specified, the named variable is given the value of the enclosure. 
When the type attribute is 'plus', the enclosure is converted to a 
numeric value and added to the current value of the named 
variable. When the type attribute is 'cat', the enclosure is 
concatenated to the string value of the named variable. 

If not declared, allocate space for the variable as discussed above 
with reference to <deR^ar />. Set the value of the variable to the 
specified or calculated value. 


<substring> ... </ substring> 
min-number 


Return a string being a portion of the enclosure beginning with 
the character at a position indicated by the specified mm number 
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m3X=number 


and continuing through the character at the position indicated by 
the specified max number. 


<translate> ... <f translate> 

VQplsLC^string 
option= i 1 g 


Replace an undesu-ed substring (specified by the value of the 
pattern attribute) with a desired substring (specified by the value 
of the replace attribute) in a given string specified by the 
enclosure. When the value of the option attribute is 'T, ignore 
case in finding matches. When tiie value of the option attribute is 
"g", replace all occurrences. Otherwise, the comparison is case 
sensitive and only the first occurrence (if any) is replaced. 


<xsl:template> ... </ xsl:template> 

match=5/r/«g 

name-string 

pnonty=^number 

modG=strmg 


Use this element to defme a template rule according to XSLT as 
described by W3C. 

Process the rule in accordance with XSLT. 


<xsl:apply-templates> ... <xsl:apply- 

templates /> 

solQCtr^XPatkspec 

mode=5/r/wg 


Used this element to invoke a template rule according to XSLT. 
Process the invocation in accordance with XSLT. 


<expr> ... <y expr> 

eval= plus 1 minus | times | over 


This element generally encloses two or more argument (i.e., 
<arg> elements) that each provide a value (e.g., name a variable, 
parameter, define a constant, or call a subroutine that provides a 
return). If the first of two arguments provides a variable name, 
the result is assigned to that named variable. If three arguments 
are provided, the first must be a variable name, which variable 
receives the result of applymg the operation (specified by the eval 
attribute) to the second and tfiird arguments. 

Calculate the sum, difference, product or quotient and assign the 
result to the specified variable. 
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<cond> ... <cond> 

eval= gt 1 It 1 eq 1 or 1 and | not | plus | 

minus | same 


Generally follows the <if> start tag. Specifies the condition for 
the branch. The enclosure includes the arguments (e.g., <arg> 
elements) to be evaluated. An element having a "true" or 
"false" value (e.g., a defined element or a variable element) 
may be enclosed without more. When the eval= attribute is 
specified, the enclosure provides suitable argument elements 
for the operation. The eval= attribute is followed by one of the 
following: 




gt 


A numeric evaluation: if argl greater than arg2 




It 


A numeric evaluation: if argl less than arg2 




eq 


A numeric evaluation: 
if argl is equal to arg2 




or 


Logic operators that may be used to combine other 




and 


conditions. 




not 






plus 


An arithmetic operation that may be used to combine 




minus 


arguments. 




same 


A comparison of identity of string arguments: 
if argl same arg2 




Calculate the "true" or "false" result and provide it for 
processing, generally for use in an <i£> element. Numeric results 
may be deemed "true" if zero and "false" otherwise. 


<arg> ... </ arg> 


Encloses an argument definition used for example in the 
conditional expression of a <cond> element. 

The enclosure is to be used as a member of an expression 
element or a condition element and not as implied output. 


<environment /> 
nam&=strmg 


Returns a value of the environment variable specified by the 
value of the name attribute. 



35 
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<nodestat l> 
sehctr^XPatkspec 
idr^nodejd 
sp=(n, ni) 
keymasks=5/r/?2g 
stat_type= title | num_bytes | 
num_bytes in links | num_words | 
num tables | num children | 
num_liiiks | nuin_words_in links | 
node_id | content_type | (^attribute- 
name 
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Represents a result of a statistical evaluation of the content of 
the specified node. The node may be specified as discussed 
above with reference to the <node /> element. A conditional 
expression <cond> element may refer to <nodestat /> as an 
argument in connection with any other argument for effecting a 
comparison. The first such comparison may be nested within a 
second conditional expression <cond> construction to compare 
the result to a threshold or limit. The stat_type= attribute is 
followed by one of the following: 



title 



num_bytes 



num words 



num tables 



num links 



num words in links 



Returns the text string comprising 
the first sentence of the node. 



Returns the number of bytes in the 
text of the node. 



Returns the number of words in the 
text of the node. 



Returns the number of tables of the 
node. 



Returns the number of links of the 
node. 



Returns the number of words of the 
links of the node. 



The return value is a number when the specified type refers to (a) 
the number of bytes in the content (i.e., text value or enclosure) at 
the designated node, (b) the number of bytes used in all hypertext 
links in the content at the designated node, (c) the number of 
words separated by white space in the content at the designated 
node, (d) the number of tables defined in the content at the 
designated node, (e) the number of children nodes of the 
designated node, (f) the number of hypertext links in the content 
at the designated node, and (g) the number of words separated by 
non-alphabetic characters appearing in all hypertext links in the 
content at the designated node. The return value is a string when 
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the specified type refers to (a) the value of the title attribute at the 
designated node, (b) the node identifier (e.g., a string such as "0- 
0-1-2-1") of the designated node, or (c) the value of the node type 
attribute at the designated node. Wlien stat_type specifies the 
name of an attribute, the return value may be a number, string, or 
tree in accordance with the value of the named attribute at the 
designated node. 

Return the value indicated by the type of node statistic specified 
by stat_t3^e. 


OUTPUT ELEMENTS 


Output elements generally provide data as derivative content. 
Output may also constitute guidance for further processing (e.g., 
automatically prepared script for transcoding any primary 
content, including the current content), or activate another 
process running on a server or client. 


text 


Text appearing at the top level (enclosed only by the <mask> 
element) is assumed to be specified for output. Text appearing as 
an enclosure of an element that does not consume the text is 
assumed to be specified for output. An assignment or 
designation is generally intended to use (i.e., consume) the text 
without implying the same text is to be output. Any element 
having a start tag that is not reserved to MASK or is not defined 
as a subroutine is passed to the output without parsing its 
attributes or enclosure. Such an element is expected to be 
properly formed. 

Text is concatenated to any current value at the current node of 
the output tree, unless enclosing tags dictate another destination. 


<commen1> ... </ comments 


A conmient node is added after the current node of the output 
tree, unless enclosing tags dictate another destination. The new 
comment node value is taken fi*om the enclosure. 


<postfieId> ... <;/postfield> 


Use the enclosure of this element and the url= attribute of the 
required enclosing <module> element to prepare and send a 
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message to another server via a suitable protocol (e.g., Simple 
Object Access Protocol (SOAP) as marketed by Microsoft). See 
Examples 2 and 3 belovv^. 



10 



15 



<output> ... <l outputs 
fi\Q-string 



Output is presumed to be intended for the standard output by 
default. To override this default, a filename or I/O designation 
(e.g., the server operator console) may be specified using the 
file= attribute. Nested <output> elements may be used to 
designate output to various destinations while retaining a parent 
context. 

Create an output tree representation for the specified file. When 
the end tag is encountered, traverse the tree depth first to provide 
a character stream for conventional character output. When the 
stream has been transferred, close the specified file. 



A system and method for preparing guidance, according to various aspects of the 
present invention, presents the user with a graphical user interface with which the user may retrieve 
information form a computer network and define guidance to be associated with the retrieved 
information. Guidance may then be used for an audio user interface or a limited display user interface 
as discussed above. For example, when the computer network includes the Internet, the user may 
operate a browse process to retrieve a web page and define guidance of the type described above. The 
user may perform operations (e.g., mouse clicks) in a graphical environment to direct preparation of 
guidance by automatic text generation. Text that is generated may be in a markup language, for 
example the MASK markup language as discussed above. 

A method for defining guidance, according to various aspects of the present 
invention, includes any process that includes, inter alia, selecting a portion or region of information, 
annotating selected information, and/or creating a description of the information (e.g., a skeletal 
description) for use in associating guidance with the selected information. For example, message 
sequence 800 of FIG. 8 may be accomplished by workstation 111, transcoder proxy server 125, and 
any site server 142. These devices have been described above including serve process 226 performed 
by any site server 142. Browse process 832, performed by workstation 111 may include any 
information requesting and presenting program suitable for network 130. When network 130 includes 
the Internet, browse process 832 may include any conventional Internet browser (e.g., Internet 
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Explorer marketed by Microsoft). MASK edit process 834, performed by transcoder proxy server 
] 25 may be constructed using any conventional programming technology suitable for use on network 
1 30. For example^ MASK edit process 834 may be developed using known languages and interfaces 
including C++, PERL, CGI APIs, JAVA, JavaScript, and XWindows, to name a few. 

5 Guidance defined during the edit session will define what and how information will 

be presented in derivative content. Derivative content is derived fi*om primary content (e.g., for which 
the model page is a prototype) by application of guidance defined during an edit session. 

Message sequence 800 will now be described assuming network 130 includes the 
Internet. In response to user input, browse process requests a page (step 802) from MASK edit 

1 0 process 834. The address of the requested page may be known by the user, or the user may be guided 
to the appropriate page by a hypertext link in another page that is known by the user. The URL of the 
requested page may include (in any conventional manner) indicia of a model page to be edited. 

On receipt of the request from browse process 832, MASK edit process 834 requests 
(step 804) the model page (or a defauh page) from any site server 142. Serve process 226 returns the 

15 desire model page (step 806) in any conventional manner (e.g., in a markup language with animations, 
streaming audio and/or video). 

MASK edit process 834 prepares a presentation in accordance with a portion of the 
model page (step 808) and sends the presentation (e.g., EDIT PAGE) to browse process 832, The 
presentation may be made less complex in any manner as described above (e.g., as directed by 

20 guidance from database 230) or in any conventional manner. The presentation may be made using the 
audio user interface or the limited display user interface described above, wherein predefined 
guidance is associated with the EDIT PAGE. Preferably, efficient development of guidance for the 
model content may be obtained using a presentation that includes one or more of the elements 
schematically shown in screen 900 of FIG. 9 and described in Table 7. 

25 

TABLE? 



30 



Reference 


Description 


Alternatives 


Designation 






Panel 902 


Provides uninterrupted access to editor 


The functions of panel 902 may be 




commands, statements that have been 


provided by any combination of 




manually or automatically developed, and 


conventional drop-down menus, tool 




entries for annotations. 


bar buttons, and/or dialog boxes 






having conventional tabbed pages of 






widgets. 
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Panel 904 


Presents any desired portion or region of the 
model page, e.g., for defining the scope or 
subject matter of an editor command. 


Panel 904 and any number of portions 
of simultaneous model pages may be 
presented in any combination of 
conventional tiled (split screen), 
framed, and windowed display 
technologies. Scroll bars may be 
added for panning images having a 
scale too large for presentation in their 
entirety in one tile, frame, or window. 


Text box 906 


Provides visibility and entry/edit ftmctions 
for statements or arguments, e.g., elements, 
tags, attributes, values, and syntax in the 
MASK language. 


Presentation may be as a wrapped text 
string, outline formatted to illustrate 
the paired and hierarchical scope of 
tags, or illustrated graphically, e.g., 
using flow chart symbols. 


Text boxes 
908 and 910 


Provide entry/edit functions for annotations. 


Entry may be made in box 906 
consistent with the presentation 
alternatives for box 906 discussed 
above. 


Buttons 912, 
914,916, 
918, 920, 
921, and 922 
activated, 
e.g., by a 
mouse click. 


DESCRIPTION - permits entry/edit of text 
and parameters in box 906 between 
automatically generated <description> ... 
</description> tags as discussed above. 
Typically, the description section includes 
<node>, <header>, and <tail> tags for 
annotation. 

SELECTION - permits entry/edit of text and 
parameters in box 906 between 
automatically generated <selection> ... 
</selection> tags as discussed above. In 
addition, when various check boxes 
described below are asserted, the content 
associated with each may be automatically 
specified in box 906. The selection section 


Tool bar buttons, dialog box buttons, 
or menu items may be used in 
combination or substituted for any or 
all buttons. Additional tool bar 
buttons, dialog box buttons, and/or 
menu items may be implemented for 
any MASK element, attribute, 
constant, or syntax to increase 
input/edit efficiency of often used 
characters. 
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typically includes one or more <node> tags. 
REMOVE - features or text appearing in the 
model page may be designated as not to 
appear in derivative content. When various 
check boxes described below are asserted 
for a remove action, the "remove" parameter 
is automatically added to the associated 
<node> tag. 

SELECT ALL - same as SELECT, though 
no prerequisite check boxes need be 
asserted. 

ENFORCE - Enters "keymasks-" 
arguments in <node> tags corresponding to 
content in asserted check boxes. Each word 
in the content gives rise to a separate 
"keymasks=" specification. Relative 
operations (e.g., "<-") are added to conform 
to the "id=" argument. 
SKELETON - Creates a description of the 
entire model content in text box 906 as a 
suitable <sklt?^ ... </sklt> entry. Enters 
precalculated "sp=" arguments in <node> 
tags for items having asserted check boxes. 
SAVE - Saves the contents of box 906 in 
database 230 and terminates the edit 
process. May include eflfectively asserting 
the SKELETON button prior to SAVE. 




Check boxes 
932 and all 
boxes of 
similar 
appearance 


A mouse click in a check box selects the 
associated elements for operation by an edit 
command. Box 932 selects the 2-row table 
934, Boxes 936-942 select cells in table 
934. A selected cell may have a variety of 
content elements, e.g., row 942 includes a 


Selection may be accomplished by a 
mouse-drag to paint selected items or 
to enclose selected items in a perimeter 
formed during the drag. 
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text box 944, a button 946, and a link 948. 





5 Whenever the user has completed entry of text in a text box (e.g., 906, 908 or 910), or 

activates a button (e.g., 912-922, or alters the assertion of one or more check boxes, a message 
describing the action(s) taken may be sent to MASK edit process 834 (step 810). The message may 
be in the form of a URL with parameters. MASK edit process 834 may respond with an updated edit 
page (step 812) that presents panel 902 cleared and ready for fiirther operations and panel 904 updated 

10 to show the effect of accumulated annotations, if any. These two operations (steps 810 and 812) are 
repeated as many times as desired. In response to user assertion of button SAVE 922, browse process 
832 may send (step 814) a message indicating termination editing is desired. MASK edit process 834 
may then post guidance to MASK database 230 (step 816) and send an acknowledgement (e.g., a 
predefined page) to browse process 832 (step 818). 

15 Prior to terminating. Mask edit process 834 may prompt the user to specify 

information used for accessing the guidance resulting from editing. For example, the user may be 
prompted to provide information corresponding to one or more of NAME, DESCRIPTION, 
USER_ID, MODEL_ADDRESS, APPLIES_TO_ADDRESS, HAS_REGULAR_EXPRESSION, or 
IS_FIRST, as discussed above. Edit process 834 may also supply values by analysis or default. 

20 Guidance may be stored as one or more records in database 230 in any conventional 

manner. Guidance may be indexed according to the URL of the model content. Guidance may be 
indexed by a URL comprising a regular expression derived from the URL of the model content so that 
the guidance is indicated as applicable to all primary content URLs that matches the regular 
expression. Guidance may be stored or accessed in a hierarchical manner that maps portions of the 

25 URL to directories (or folders) having subdirectories (or subfolders). A method for accessing 
guidance may include the following steps: 

1 . Search for the URL exactly as presented (e.g., the PC URL of FIG. 4); if found quit, 
otherwise continue. 

2. Truncate the last (child) portion of the URL. For example, "http://www.news.comAJS/2000- 
30 OCT-30.htm" may be truncated to "http://www.news.com/US/". If the truncated URL is found, quit; 

otherwise continue. 

3 . Search for a sibling of the truncated URL of step 2 and if found, use the guidance associated 
with the sibling; otherwise, continue. In the example of step 2, this step 3 would use 
"http://www.news.com/US/[.] 

35 4. Truncate the URL keeping only the scheme and the first domain designation. If the truncated 



62 



46806.00007 



URL is found, quit. Otherwise, no guidance is available. For example, derivative content 440 is 
prepared by the cooperation of analyze process 406 and reduce complexity process 408. 

The following sequence of user inputs also described in Table 8 would produce 
guidance for model content (e.g., selected and annotated content) represented by the schematic display 
5 of FIG. lOA. "Click" means that the mouse left button is pressed and released when the mouse 
pointer is located over the indicated feature. 



1 . Click in box 950 to indicate that the entire table 950 is to be the subject of a subsequent 
operation. 

10 2. Click button 914 SELECTION to cause table 952 to appear in the derivative content. 

3. Click in box 980 to indicate that one cell of table 950 will be the subject of a subsequent 
operation. Click button 916 REMOVE to omit box 980 from derivative content for table 952. 

4. Click in boxes 956 and 958; type "FOR" in PREAMBLE box 908; and click button 9 1 2 
DESCRIPTION so that the word "FOR" will be presented (e.g., recited in audio) before the 

15 remaining content in each indicated cell. 

5. Click in boxes 968 and 970; type "CHANGE IS" in PREAMBLE box 908; and click button 
912 DESCRIPTION so that the words "CHANGE IS" will be presented (e.g., recited in 
audio) before the remaining content in each indicated cell. 

6. Click in boxes 974 and 976; type "AT" in PREAMBLE box 908; type "POINTS" in 

20 POSTAMBLE box 910; and click button 912 DESCRIPTION so that the word "AT" will be 

presented before and the word "POINTS" will be presented after the remaining content in 
each indicated cell. 

7. Assert only box 954 and then click button 920 ENFORCE. "Keymasks=" arguments will be 
added to descendent nodes of table 952 in the description section and/or the selection section. 

25 8. Click on button 921 SKELETON. A skeleton of the model content will be generated. An 
"Sp=" argument will be added to each node in the description and/or the selection section. 
9. Click button 922 SAVE to teminate editing and save the text in box 906 as guidance. 

Guidance may also include data corresponding to the text in box 906. For example, the 
skeleton may be saved in a form that includes an association between node names and skeletal 

3 0 characters of the type described above with reference to records 415. 



35 
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TABLE 8 



10 



25 



30 



35 



Step 


Text in Box 906 After Performing the Step 


Comment 


1 


<mask> <description> </description> <selection> </selection> </mask> 


Box 906 
maybe 
initially 
empty. 


2 


<mask> <description> </description> <selection> <node id="0-0-l"> </node> 
</selection> <mask> 


Because 
box 932 
was not 
asserted, 
table 934 
will not 
appear in 
derived 
content. 


3 


<mask> <description> <node id="0-0- 1-3-1" remove> </node> </description> 
<selection> <node id^"0-0-r'> </node> <yselection> </mask> 


One cell is 
removed. 


4 


<mask> <description> <node id="0-0-l-l-0"> <header> FOR <;/header> 
</node> <node id="0-0-l-2-0"> <header> FOR </header> </node> <node 
id- '0-0-1-3-1" remove> <node> </description> <selection> <node id="0-0- 
!"> </node> <yselection> </mask> 


Preamble 

annotation 

added. 


5 


<mask> <description> <node id="0-0-l-l-0'> <header> FOR <header> 
</node> <node id="0-0-l-2-0"> <header> FOR <yheader> <node> <node 
id="0-0-l-l-r> <header> CHANGE IS </header> </node> <node id="0-0-l- 
2-l"> <header> CHANGE IS <;aieader> </node> <node id="0-O- 1-3-1" 
remove> </node> </description> <selection> <node id="0-0-l"> </node> 
</selectio!i> <ymask> 


Preamble 

annotation 

added. 


6 


<mask> <description> <node id="0-0-l-l-0"> <header> FOR <;^eader> 
</node> <node id="0-0-l-2-0"> <header> FOR <header> </node> <node 
id="0-0-l-l-l"> <header> CHANGE IS <;aieader> <ynode> <node id="0-0-l- 
2-I"> <header> CHANGE IS <yheader> <ynode> <node id="0-0-l-l-2"> 
<header> AT </header> <tail> POINTS </tail> ^nodO <node id="0-0-l-2- 
<header> AT </header> <tail> POINTS <ytail> <node> <node id="0-0-l- 


Preamble 
and 

postamble 

annotations 

added. 
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5 



10 



15 



25 



30 



35 



Step 


Text in Box 906 After Performing the Step 


Comment 




3-1" remove> </node> <;/description> <selection> <node id="0-0-l"> </node> 
</seIection> </mask> 




7 


<mask> <description> <node id="0-0- 1-1-0" keymasks= "MARKETS: <-<-<- 
1-0 SNAPSHOT: <.<-<-l-0" > <header> FOR </header> <node> <node 
id="0-0- 1-2-0" keymasks- "MARKETS: <-<-<-2-0 SNAPSHOT: <-<-<-2-0" > 
<header> FOR <header> <node> <node id="0-0-l-l-l" keymasks= 
"MARKETS: <-<-<-!-! SNAPSHOT: <-<.<-l-l" > <header> CHANGE IS 
</header> <ynode> <node id="0-0- 1-2-1" keymasks= "MARKETS: <-<-<-2-l 
SNAPSHOT: <-<.<-2-l" > <header> CHANGE IS <header> </node> <node 
id="0-0-l-l-2" keymasks= "MARKETS: <-<-<-l-2 SNAPSHOT: <-<-<-l-2" > 
<header> AT <header> <tail> POINTS <ytail> <ynode> <node id="0-0- 1-2-2" 
keymasks= "MARKETS: <-<-<-2-2 SNAPSHOT: <-<-<-2-2" > <header> AT 
<;/header> <tail> POINTS </tail> </node> <node id="0-0- 1-3-1" keymasks= 
"MARKETS: <-<-<-3-l SNAPSHOT: <-<-<-3-l" remove XnodO 
</description> <selection> <node id="0-0-l" keymasks= "MARKETS: <-<-< 
SNAPSHOT: <-<-<" > </node> </selection> <ymask> 


Adds 
keymasks. 


8 


<mask> <description> <node id="0-0- 1-1-0" keymasks= "MARKETS: <-<-<- 
1-0 SNAPSHOT: <-<-<-1.0" sp= "(27,29)" > <header> FOR <header> 
<ynode> <node id-"0-0- 1-2-0" keymasks= "MARKETS: <-<-<-2-0 
SNAPSHOT: <-<-<-2-0" sp= "(38,40)" > <header> FOR <header> </node> 
<node id="0-0-l-l-l" keymasks= "MARKETS: <-<-<-l-l SNAPSHOT: <-<- 
<-l-l" sp- "(30,32)" > <header> CHANGE IS </header> <node> <node 
id="0-0- 1-2-1" keymasks= "MARKETS: <-<-<.2-l SNAPSHOT: <-<-<-2-l" 
sp= "(41,43)" > <header> CHANGE IS </header> <node> <node id="0-0-l-l- 
2" keymasks= "MARKETS: <-<-<-l-2 SNAPSHOT: <-<-<.1.2" sp= "(33,35)" 

> <header> AT <;/header> <tail> POINTS </tail> <node> <node id="0-0-l-2- 
2" keymasks= "MARKETS: <-<-<-2-2 SNAPSHOT: <-<-<-2-2" sp= "(44,46)" 

> <header> AT </header> <tail> POINTS </tail> </node> <node id="0-0-l-3- 
1" keymasks= "MARKETS: <-<-<-3-l SNAPSHOT: <-<-<-3-r' remove 
sp="(52,54)" > </node> <ydescription> <selection> <node id- '0-0-1" 
keymasks= "MARKETS: <-<-< SNAPSHOT: <-<-<" sp= "(20,64)" > </node> 
<;/selection> <sklt> HPTRD%dD%dD_drRD_drt 

TRD%drRD%dD#dD#drRD%dD#dD#drRD%dD%drRD%dD%drtph<ysklt> 


Adds 

skeleton 

and 

references. 
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Step 


Text in Box 906 After Perfonning the Step 


Comment 




</mask> 





As a result of the above command sequence, only table 952 will appear in the 
derivative content v/hen primary content similar to model content of panel 904 is transcoded. Table 
952 will be presented with rows 1082 and 1084 revised for a more understandable audio recitation. 
For example. The recitation based on FIG. lOA may include: "Markets Snapshot. For Dow, change is 
minus forty nine and sixty four hundredths at eleven thousand one hundred sixty nine and fifty 
hundredths points. For 'naz-dack', change is minus one hundred one and forty hundredths at five 
thousand forty two and one hundredth points." The pronunciation of NASDAQ may be specified in a 
dictionary as described above. 

When a set of pages has been or will be prepared on demand to present via an audio 
user interface or a limited display user interface, transcode process 218 may include additional links 
to each page. For example, FIG. lOB depicts the display of a limited display device showing 
derivative content formed in accordance with guidance as discussed above with reference to FIGs. 9 
and lOA; and, fi"om primary content from a different date than the model content. Display 1090 
includes content 1091, auxiliary links 1092, and switch link 1094. Auxiliary links 1092 provide 
access to other pages (for news content, weather content, and sports content) derived from the same 
primary content (e.g., by summarization as discussed above, or by user definition of guidance for each 
auxiliary page). In addition, following link 1094 may facilitate transfer of session control as 
described below with reference to FIG. 11, In an audio presentation, auxiliary links may include 
recited control functions such as a menu of items discussed above with reference to Table 2. 

The limited display of FIG. lOB provides about 6 lines each with a line length of 
about 45 characters. If the display provided a line length of less than 45 characters, the lines as shown 
may be wrapped onto additional lines. The limited display device may have a buffer for retaining 
more lines than fit on the display screen and provide user operated scrolling controls to effectively 
move different portions of the buffer onto the display for presentation. 

Predetermined guidance may be applied to primary content to produce derivative 
content. As discussed above, model content may be used to develop the predetermined guidance. 
Predetermined guidance may include a description of the model content, for example, a skeletal 
description in the MASK markup language. In addition, a description of any primary content may be 
prepared and used: (a) to identify whether particular predetermined guidance that has been associated 
with similar model content is to be used with the primary content; and (b) to prepare the derivative 
content. The first use may be accomplished by comparing the description of the primary content with 



66 



46806.00007 



a description of the model content. The second use may be accomplished by aligning the description 
of the model content to the description of the primary content. 

For example, in method 304 of FIG. 4, preparation of derivative content may be 
performed by transcoder proxy server 125 as part of transcode process 218. To prepare derivative 
content based on primary content, according to various aspects of the present invention, a skeletal 
description of suitable model content is located, for example, with reference to a URL of the primary 
content and a URL that includes a regular expression as discussed above. 

After the primary content has been received, a skeletal description of the primary 
content may be generated. For example, primary content somewhat similar to model content 
schematically presented in panel 906 of FIG, 9 is presented in HTML in Table 9. Typical primary 
content is of considerably greater quantity and complexity. Note that panel 906 may be prepared for 
readability by, inter alia, redacting the actual news stories, inserting table lines, and adding check 
boxes for selecting various features for convenience of preparing guidance. 



TABLE 9 
Primary Content in Markup Language 
<html> <body> ~ ^ ~ =— = = _ ^ ^ 

<table border> 

<tr> <td colspan=l> <b>HEADLINE STORY: Authorities in several countries have expressed 
interest in adopting a world-wide uniform monetary system. The United States has not yet taken a 
supportive role, though delegates from the Federal Reserve Board are attending. Informed sources 
say a strictly metric quantification (1-10-100) is suggested to replace the 1-5-10-20-50-100 system 
used in the United States. <lh> </td> 

<td colspan=l> <b>TECHNOLOGY: Powerful lasers have been aimed at the moon in an 
experimental communication system hoped to replace broadcast and satellite television, according to 
researchers at the Lunar Labs Consortium. Proponents say the system will provide a hierarchical 
organization of television entertainment with links between shows to similar shows and related work 
by the same actors and actresses.<b> </td> 

<td colspan=l> <a href=freeaccess.html> FREE INTERNET ACCESS </a> </td> </tr> 
<tr> <td colspan=3> <form action=search.pl> <input type^text name=search> 
<input type^submit name=submit value=search> <a href=advanced_search.pl> 
ADVANCED SEARCH</a> </fonn> </td> </tr> </table> 
<table border> 

<tr> <td colspan=3 align=center> <b>MARKETS SNAPSHOT</b> </td> </tr> 
<tr> <td colspan=l>DOW</td> <td colspan=l>-49.64</td> 

<td colspan=l>l 1169.50</td> </tr> 
<tr> <td colspan=l>NASDAQ</td> <td colspan=l>-101.40</td> 

<td colspan=l>5042.01</td> <tr> 
<tr> <td colspan=l>LAST UPDATE AT 03/24 12:23pm</td> 

<td colspan-l>SPONSORED BY XTRADE</td> </tr> 
<tr> <td colspan=l> <form action=stockquotes.pl> <input type=text name=quote> 

<input type=submit name=q value=QUOTE> </form> </td> 

<td colspan=l>TRACK WITH NEWSCORP</td> </tr> 
</table> <ybody> </html> 
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Using the MASK markup language, a description of the primary content of Table 9 is 
presented in Table 10. White space has been added to clarify the comparison and generally would not 
appear in the skeletal descriptions. 



TABLE 10 
Description of Primary Content 

<sklt> 
HB 

TRD$dD$dD_%dr 
10 RDFII_%fdr 
TRD%dr 

RD%dD#dD#dr 

RD%dD#dD#dr 

RD%dD%dr 

RDFII%fdD%drt 
bh 

1 c </sklt> 



The MASK markup language identifies non-structural elements (e.g., table cell 
contents, text, numbers, links, graphics, etc.) with reference to numbered nodes of the primary content 
(and similarly for the model content). Numbered nodes do not appear explicitly in the markup 
language of the primary content, but they are apparent from analysis of the nesting level of the tags 
used for the content. A pair of tags enclosing content denotes a node of the tree. The root and all 
nodes at the same level are numbered from zero, left-to-right. Examples of nodes are illustrated in 
Table 11. 



TABLE 11 



35 



Content 


Node Identification 


<html> 


0 


<body> Contributors 


0-0 


<table> 


0-0-0 


<tr> 


0-0-0-0 


<td> Abel </td> 


0-0-0-0-0 


<td> Baker </td> </tr> <tr> 


0-0-0-0-1 


<td> Cook </td> 


0-0-0-1-0 


<td> Dickson </td> </tr> </table> 


0-0-0-1-1 


<table> 


0-0-1 


<tr> 


0-0-1-0 


<td> Egan </td> 


0-0-1-0-0 


<td> Fromme </td> </tr> <tr> 


0-0-1-0-1 


<td> Gable </td> 


0-0-1-1-0 


<td> Harris </td> </tr> </table> <body> </html> 


0-0-1-1-1 
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Using the node numbering as in Table 1 1, the non-structural content "Dickson" could 
be copied to the derivative content with reference to its node number "0-0-0-1-1". Because the node 
numbering of primary content is subject to change (e.g., a new table is added to the body) content to 
be copied to the derivative content may be identified with reference to other structural and non- 
structural features of the primary content. In the model content shown in FIG. lOA, for example, the 
word "DOW" may be used (instead of the words MARKETS SNAPSHOT) to locate the row 982, 
1082 in which the numeric value of the Dow Jones Industrial Average is likely to be found. Using 
references relative io row 982, node names for other rows and for the entire table 952 may be 
determined in the model content for use with primary content (e.g., by process 428). Node names 
with relative references may be more likely to align to future (e.g., different) primary content. 

According to various aspects of the present invention, reference to content to be 
copied to derivative content is made with reference to one or more anchors. An anchor may be 
located by aligning some or all of a skeletal description as discussed above, or by aligning any one or 
more elements (e.g., a text string "DOW", a passage of an expected number of words, a number of 
links, a link having an expected number of words, etc., as facilitated by parameters of the node 
guidance tag of Tables 4 A and 4B). 

Table 8 step 8 provides an example of guidance that includes model content as 
discussed above with reference to an edit session and FIG. lOA. Note that each anchor is identified 
by a specification between <node></node> tags. In this example, multiple keymasks are defined in 
addition to a node number and skeletal position. Transcode process 218 locates the desired content 
for inclusion in derivative content in accordance with any one or more of the anchor specifications. 

Derivative content may be prepared for presentation by a browse process of a 
workstation 1 1 1 (see Table 12), for presentation using an audio user interface and an audio device 202 
(see Table 13), or for presentation using a limited display device user interface and a limited display 
device 702 (see Table 14). Derivative content shown in Tables 11-13 was prepared using primary 
content of Table 8 and guidance of Table 8 step 8 and therefore includes only the features of table 952 
of FIG. lOA. For example, cell 980 is omitted. 

TABLE 12 

Derivative Content for a Workstation GUI 

<html> <table border> 

<tr> <td colspan="3" align="center" > <b> MARKETS SNAPSHOT </b> </td> </tr> 
<tr> <td colspan="l"> FOR DOW </td> <td coIspan="l"> CHANGE IS -49.64 </td> 
<td colspan-"l"> AT 1 1 169.50 POINTS</td> </tr> 

<tr> <td colspan="l"> FOR NASDAQ </td> <td colspan-"l"> CHANGE IS -101.40 </td> 
<td colspan="l"> AT 5042.01 POINTS </td> </tr> 

<tr> <td colspan=="r'> LAST UPDATED AT 03/24 12:23 PM </td> </tr> 
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^ Derivative Content for a Workstation GUI 

<tr> <td colspan^"l"> <form action= 

"/cgi-bin/mask_selection_ie.pl?url=http://defurl/stockquotes.pr' method="post"> 
<input type="text" name="quote" > <input type="submit" name="q" value-"QUOTE" > 
</form> </td> 

<td colspan-" r '> TRACK WITH NEWSCORP </td> </tr> 
5 </table> <yhtml> 



Male and female voices are used in the audio user interface to indicate the difference 
between information and a link. A link may be announced in a female voice. When the link is 
follov^ed, the announcement of the content may restate the same words for orienting the user; 
however, the restatement may be in a male voice. For example, when "form 1" is first encountered as 
a link, a female voice announces it. When a user says "form 1" to follow the link, a male voice may 
acknowledge that the link was followed and may announce "form 1" as a title before reciting the 
content of form 1 . 



25 TABLE 13 

Derivative Content for Audio User Interface 

<?xml version="1.0"?> <vxml version="1.0" > <form> <block> <prompt> 
<pros pitch="male" > MARKETS SNAPSHOT </pros> 
<pros pitch="male" > for </pros> 
<pros pitch="male" > DOW </pros> 
<pros pitch="male" > change is </pros> 
20 <pros pitch="male" > -49.64 </pros> 
<pros pitch="male" > at <ypros> 
<pros pitch="male" > 11169.50 </pros> 
<pros pitch="male" > points </pros> 
<pros pitch="male" > for </pros> 
<pros pitch="male" > NASDAQ </pros> 
<pros pitch="male" > change is </pros> 
<pros pitch="male" > -101.40 </pros> 
<pros pitch="male" > at </pros> 
<pros pitch-"male" > 5042.01 </pros> 
<pros pitch="male" > points </pros> 

<pros pitch="male" > last updated at 03/24 12:23pm </pros> 
<pros pitch="female" > form 1 </pros> 
<pros pitch="male" > TRACK WITH NEWSCORP </pros> 
</prompt> </b]ock> 
30 <link next="#form_l " > 

<grammar type="appiication/x-jsgf' > form 1 </grammar> </link> </form> 

<form id=="form l " > <block> <prompt> 

<pros pitch="male" > form 1 </pros> </prompt> </block> 

<field name="quote" > </field> 

<block> 

<submit next="/cgi-bin/mask_selection_ie.pl?url=http://defurl/stockquotes.pl" /> 
<^lock> </form> </vxml> 
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TABLE 14 

Derivative Content for Limited Display User Interface 

<?xml version="1.0"?> 

<!DOCTYPE wml PUBLIC "-//PHONE.COM//DTD WML 1.1//EN" 
5 "http://wwv^.phone.com//dtd/wmIl 1 .dtd"> 
<wml> <head> 

<meta http-equiv="Cache-Control" content="max-age=60" forua="true" /> 
</head> 

<card id="none" > 

<onevent type="onenterforward" > 

<refiresh> 

<setvar name="defurlroot" value="/cgi-bin/mask_selection_ie.pl?url=" t> 
10 </refresh> 
</onevent> 

<p> <table columns="3" > 

<tr><td><b> MARKETS SNAPSHOT <^><td></tr> 

<tr> <td>for DOW</td> <td> change is -49.64 </td> <td> at 1 1 169.50 points </td> </tr> 
<tr> <td> for NASDAQ <V^td><td>change is -101.40</td><td>at 5042.01 points </td> </tr> 
<tr> <td>last updated at 03/24 12:23pm</td> </tr> 
^ 5 <tr> <td> TRACK WITH NEWSCORP </td> </tr> 
</lable> </p> 

<do type="accept" label="formO" > 

<go href="#formO" > </go> </do> <ycard> 

<card id="formO" > 

<do type=-"accept" label="submit" > 

<go href="$(defurlroot:noesc)http://defurl/stockquotes.pl" method="post" > 
<postfield name-"quote" value="$quote" /> </go> </do> 
20 <p> <input type^"text" name="quote" t> </p> </card> </wml> 

According to various aspects of the present invention, a user having more than one 
access device (or a device suitable for use with more than one user interface) may from time to time 
use two or more access devices (or user interfaces) in series or in parallel. The user may provide input 

25 via any of the user interfaces discussed above (e.g., workstation GUI, audio user interface, or limited 
display user interface) and effect system operation with one or more of these interfaces from that point 
forward. For example, a user may begin a session accessing the Internet via a wireless device 1 12 or 
702 and a limited display device user interface, indicate to the browse process 708 that the session is 
to continue in audio, and possibly indicate to the audio user interface (e.g., voice browser 216 as 

30 discussed above) that the session is to resume on the limited display device. In an alternate 

implementation, session switching may include control from time to time by a workstation 111 GUI 
832. 

During a transfer of control, one or more of these access devices or user interfaces 
may have exclusive control of the session, or may have nonexclusive control (e.g., an input on any 
35 user interface is treated as an input for browsing and an output is provided in parallel on all access 
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devices). Output may be provided exclusively to the interface being used in an exclusive manner, or 
may be provided as indicated by the user on two or more interfaces. 

For example, message sequence 11 00 of FIG. 1 1 may be used to facilitate transfer of 
session control exclusively from a limited display device user interface to an audio user interface and 

5 back to the limited display device user interface. In this sequence, one operator uses a web phone 

11 02 comprising an audio access device 202, a limited display device 702, and processing capability 
to support browse process 1 105, call process 1 108, and I/O process 206 (to the extent not 
implemented in circuitry). Web phone 1 102 is initially linked as a wireless device 1 12 of FIG. 1 to 
Internet 130 by link 171, wireless gateway 121, link 172, ISP server 124, and link 179. In an alternate 

10 implementation, web phone 1 102 may be linked as a wireless device 1 12 to Internet 130 by link 171, 
wireless gateway 121, link 174, transcoder proxy server 125, and link 178. Web phone is shown 
already participating in a session with audio/LDD browse process 1 106 on browser server 1 104. 
Browser server 1 104 in alternate implementations includes the functions described above with 
reference to voice browser server 123 and/or browser server 704. An audio/LDD browse process 

15 1 1 06 supports both an audio user interface and a limited display device (LDD) user interface as 
discussed above. 

Browse process 1 105 responds to user inputs (e.g., arrow keys or pen-based events). 
The user may navigate the Internet in the conventional manner selecting links using user inputs. At 
any time, the user may desire to continue the session via an audio user interface. To do so, the user 

20 may follow a link (step 1 120) provided on the limited display device by audio/LDD browse process 
1 106 (e.g., link 1094 of FIG. lOB for the current content in audio, or link 1096 of FIG. lOB for an 
Internet radio site). The link may include a URL corresponding to a TO AUDIO command. 
Alternately, limited display device may have circuitry, software, or accept user inputs to provide a 
command TO AUDIO to audio/LDD browse process 1 106. The command may conform to a URL, 

25 e.g., the URL of browser server 1 1 04. Browse process 1 1 06 may respond to the TO AUDIO 

command by obtaining information from web phone 1 102 sufficient to confirm authorization for the 
audio session and configuration information for establishing a proper audio connection. Such 
information may be available via one or more cookies or URL parameters from web phone 1 102 or 
from registration data obtained by browse process 1 106 when the initial user session with browse 

30 process 1 106 began. A login scenario, as discussed above, may also be conducted by browse process 
1 1 06. When the TO_AUDIO command is a URL, a parameter of the URL may identify a particular 
page to be delivered in audio (e.g., the same information as on the current page, another page based on 
the same primary content, an Internet radio site, a listing of predetermined audio sites, etc.). An 
identifier of the particular page (e.g., URL9) may be stored (step 1 121) in association with user 

35 information 1110 (e.g., a usemame, caller ID, origination phone number, or registration information). 
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After confirmation that the switch to an audio user interface is authorized and proper, 
browse process 1 106 may notify the user with a suitable acknowledgement (step 1 122). The 
acknowledgement may include a link and a phone number to call to establish an audio session. In one 
implementation, the phone number is part of the link in the WML markup language (e.g., <a 

5 href="wtai://wp/mc; laaaeeennnnn">TO AUDIO </a> where wtai is a scheme of the type known as 

Wireless Telephony Application Interface; "wp/mc" specifies a library of WTAPublic and a command 
"mc" to make a call; and "aaaeeennnn" would be a 10-digit US telephone number). 

The user may follow the link (step 1 123) which effects a telephone call (step 1 124) 
fi-om web phone 1102 (and call process 1 108) to browser server 1 104 (and browse process 1 106). For 

10 example, browse process 1 106 may send an applet to browse process 1 105 with the acknowledgement 
(step 1 122). Browse process 1 105 may then initiate a phone call fi-om web phone 11 02 to browser 
server 1 104 and may provide a user identification (step 1 125). Browse process 1 106 may use 
conventional caller ID technology to authenticate the user (step 1 125). 

In an alternate implementation, command TO AUDIO causes browse process 1 105 

15 to prepare for an inr.oming telephone call, and audio/LDD browse process 1 106 (having sufficient 
authorization to proceed) to effect a telephone call to web phone 1 102. 

Regardless of whether browse process 1 106 or browse process 1 105 initiates the 
phone call, web phone 1 102 is eventually connected as cell phone 1 14 of FIG. 1 via link 175, cellular 
gateway 122, and link 176 through voice browser server 123 acting as browser server 1 104, Voice 

20 browser server 123 is coupled to the Internet 130 via link 177, transcoder proxy server 125, and link 
178. 

As indicated by the user in the TO_AUDIO command or in any conventional manner, 
the recitation in audio of an appropriate initial page is provided from audio/LDD browse process 1 106 
to I/O process 206 performed by audio device 202. For example, the user of limited display device 

25 702 may have supplied URL9 as stored with user information 1110. Audio/LDD browse process 
1 106 may use the caller ID or other user information to recall an identification of desired content 
URL9 (step 1 126). Browse process 1 106 may transcode primary content of URL9 as discussed above 
or obtam streaming audio content from Internet 130 and pass it to I/O process 206 as RECITAL9 
(step 1 127). User of limited display device 702 may desire to listen to primary content transcoded in 

30 accordance with guidance as discussed above. Browse process 1 106 in this case provides functions of 
voice browser server 123 (e.g., voice recognition for navigation commands) and cooperates with 
transcoder proxy server 125. 

The user may at any time direct I/O process 206 to convey a command (TO_LDD) to 
browse process 1 106 (step 1 128) for continuing the session on limited display device 702. The 

35 command may include DTMF signaling or the user's voice command, for example, "Send Me the 



73 



46806.00007 



Page" or another suitable command as in Table 2. Browse process 1 106 responds (step 1 130) with 
suitable derivative content (PAGE 10), for example, as discussed above with reference to step 780. 
The TO_LDD command may be recognized by speech recognition software that provides notice in 
any conventional manner to other processes of browse process 1 106 to switch from an audio user 

5 interface to a limited display device user interface. 

A method for applying guidance, according to various aspects of the present 
invention, includes in any order: (a) referring to constants, variables, attributes, subroutine names, 
and parameters each having a scope of reference; (b) processing a guidance language that comprises 
procedural, declarative, analytical, input, and output elements; (c) processing a guidance language in 

10 accordance with at least one of inheritance, overloaded references, and polymorphism; or (d) 
processing a guidance language that directs operations on a plurality of trees and/or subtrees 
including, for example, aggregating information from nodes of separately stored trees and/or 
subtrees. For example, method 1200 of FIG. 12 applies guidance to primary content to provide 
derivative content. In the illustrated implementation, language processing is accomplished by 

15 interpretation (as opposed to compilation). In an alternate implementation, compilation (e.g., 

reduction to a code similar in function to JAVA P-code) is accomplished according to well known 
principles of compilation and the teachings of the present invention. 

Guidance may be received 1202 in any conventional manner, for example, as a file 
having content conforming to the MASK markup language as discussed above (e.g.. Table 4 ). 

20 Guidance may be parsed 1204 to produce a tree (e.g., a hierarchical list, linked data 

structure, database, relational database, or a DOM as discussed above). Language processing may 
proceed in any conventional manner that provides scope of references as discussed above. For 
example, a hierarchical model may be traversed 1206 in a depth-first manner to identify a current 
guidance node for language processing. Completion of traversal may be followed by receiving 1202 

25 further guidance to be applied to the same derivative content or to fiirther derivative content. 

Completion may be indicated by arriving at the top node or a stop node of a hierarchical model 
whether or not all nodes have been traversed. By parsmg the guidance in a manner similar to parsing 
primary content, and skeletal descriptions as discussed above, a less complex transcoding process may 
result as compared to conventional transcoders. 

30 Language processing of guidance at the current guidance node may include detecting 

whether a procedural, declarative, analytical, input, or output operation is to be performed 1208. 
Operations at the current guidance node may specify any combination of procedural, declarative, 
analytical, input, or output operations, in which case blocks 1210-1216 may be executed in any order 
and perhaps multiple times for different purposes (e.g., evaluation of a string and string constant, 

35 declaration of a string variable, assignment of the result of analysis to the string variable, and 
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invoking the subroutine having a name that corresponds to the value of the declared variable, all 
expressed by MASK language elements at the current guidance node or subtree). For a procedural 
operation, a possibly different guidance node may be identified 1210 as current and interpretation may 
then continue from 1208. Such operations are analogous to operations that modify the content of a 

5 program counter in a conventional microprocessor. 

For a declarative operation, a value of a variable may be created and/or revised. For 
example, one or more properties at a node of a variable of the tree type may be assigned values 1212. 
The structure of a tree may be revised 1212, for example, by creating one or more nodes, moving, 
copying, and/or pruning one or more nodes. 

10 For an input operation, language processing may: (a) await user input in any 

conventional manner, (b) obtain user or system input from the operating system supporting at least a 
portion of method 1200, or (c) obtain user or system input from another operating system or 
application program that is coupled to the language processor via any conventional networking 
technology. After receipt of input (if any within a suitable time period), language processing may 

15 respond 1214 to such input to complete any related procedural, declarative, analytic, and/or output 
function. 

For an output operation, a value of a constant, variable, or parameter may be provided 
1216 as all or any portion of derivative content. Such derivative content may be made accessible in 
any conventional manner to a suitable presentation process or processes as discussed above. 

20 On completion of a declarative, analytical, input, or output operation, language 

processing may continue 1206 to another operation in sequence, to another operation by return from a 
subroutine (e.g., a recursive return) or to another node of the guidance tree. 

A process for transcoding, according to various aspects of the present invention, 
includes any process for providing derivative content according to one or more directives of a 

25 guidance language as defined herein. In one implementation, the execution of a directive may include 
performing declarative operations (e.g., managing storage space for variables), performing input 
operations (e.g., reading from one or more input buffers), evaluating an expression (e.g., calculating a 
value from constants and the values of declared variables), performing an assignment (e.g. storing a 
new value for a variable), providing output (e.g., writing one or more output buffers), and performing 

30 procedural aspects (e.g., managing the flow of program control). For example, language processing 
of an element having an attribute-value pair may include: declaration of a variable occurring in a first 
instance in a single tag, a start tag, or enclosure; acceptance of an input value that then participates in 
the calculation of a value to assign to the newly declared variable; providing as output a constant 
and/or variable value for text identified and/or determined by any proper language construct enclosed 

35 
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between the start tag and an end tag; and determining a next position in the guidance at which further 
interpretation will proceed. 

For example, transcode process 218 of FIG. 13 accesses guidance 230 and content 
400. Transcode process 218 provides derivative content 420 by applying directives of guidance 230 

5 to primary content 400 as discussed above. Operations as defined by the guidance language may be 
implemented using one or more processes and one or more data structures. Conventional 
programming and data structure design techniques may be used to implement transcode process 218 
as described herein. Processes may be performed by one or more processors. Data may be 
represented by data structures including string or tree structures. In one implementation tree 

10 structures comprise any conventional document object model as discussed above. Any conventional 
memory systems (e.g., pipelines, RAMs, disks) may be used to store and organize the program 
executable code and data structures including, for example, data bases, files, message queues, and 
regions of memory. For example, transcode process 218 includes two processes: parser 1302 and 
executive 1304. 

15 A parser, according to various aspects of the present invention, includes any process 

that analyzes the syntax and/or semantics of guidance to identify directives and data referred to by 
each directive. For example, parser 1302 identifies elements, tags, attribute-value pairs, and text as 
discussed above and in Table 15 to determine a sequence of directives with associated references. 
Directives and references are passed to executive 1304 via directive queue 1326. 

20 ' An executive, according to various aspects of the present invention, includes any 

process that receives a sequence of directives and associated references and performs operations to 
give effect to the directives. For example, executive 1304 receives directives via directive queue 1326 
and performs the operations discussed in Table 15. Parser 1302 and executive 1304 are preferably 
implemented as an interpreter. In an alternate implementation, parser 1302 may produce compiled 

25 code in place of directive queue 1326 and the functions of executive 1304 may be integrated in the 
compiled code for operation apart from the parser. Generally, executive 1306 provides derivative 
content 440 as a result of manipulating data (e.g., constants, variables, parameters, and scripts) in 
memory. Data manipulation includes, among other things^ declaring storage areas for variables and 
parameters, assigning calculated values to variables and parameters involved in expressions for 

30 assignment or comparison, passing variables into or out of subroutines as parameters, receiving data 
which may include guidance or content, and providing data as derivative content. 

Parser 1302 includes access and analyze guidance process 1306, access and analyze 
content process 1308, create and align skeletons process 1310, and get next directive and references 
process 1312. Access and analyze guidance process 1306 accesses guidance 230 (e.g., a guidance file 

35 or an include file) in a manner as discussed above, parses the guidance in accordance with the MASK 
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language, and prepares guidance tree 1322, representing the hierarchical nature of the MASK 
language elements as used in guidance 230. Preferably, guidance tree 1322 conforms to a 
conventional document object model as discussed above. In one implementation, each node of 
guidance tree 1322 corresponds to an element and is used to store information pertaining to the 
5 element. Access and analyze guidance process 1306 may perform the functions discussed above with 
reference to find guidance process 402. 

Further, access and analyze guidance process 1306 may review guidance tree 1322 as 
a whole and provide a hierarchical list of references to subroutines that are defined at any node in 
guidance tree 1322. During transcoding that began with a first guidance file, reference may be made 
10 to calculated script and to other files. Such additional guidance may be added to guidance tree 1322 
at any suitable node. Access and analyze guidance process 1306 may review the revised guidance 
tree 1322 in whole or in part and update the hierarchical list of references to subroutines. The 
hierarchical list of subroutines may be maintained at a suitable node of guidance tree 1322. 

Access and analyze content process 1308 accesses content 400 (e.g., primary content 
1 5 or other modules) in a manner as discussed above, parses the content accordance with a markup 
language (e.g., HTML, CSS, XML, XSL, XSLT, or and language compliant with SGML) and 
prepares content tree 1324, representing the hierarchical nature of the markup language elements as 
used in content 400. Preferably, content tree 1322 conforms to a conventional document object model 
as discussed above. In one implementation, each node of content tree 1322 corresponds to an element 
20 of the markup language of the content and is used to store information pertaining to the element. 

Access and analyze content process 1308 may perform the fimctions discussed above with reference 
to analyze process 406 and records 418, 419, and 420. 

Create and align skeletons process 1310 determines whether a skeleton has been 
defined in guidance tree 1322 and, if so, creates a corresponding representation fi*om content tree 
25 1324 and aligns corresponding nodes of the created representation and the guidance skeleton, as 

discussed above with reference to find model skeleton process 406, make PC skeleton process 410, 
align process 412, and records 415, 416, and 417. Indicia of alignment (e.g., records 417, or other 
associations of node identifiers) may be stored in guidance tree 1322 or content tree 1324. 

Get next directive and references process 1312 traverses nodes of guidance tree 1322 
30 in a depth first manner beginning at the root of the tree, maintains a pointer to the so-called current 
guidance node, and determines a next guidance node in accordance with directives discussed below 
with reference to procedural elements. For each node, get next directive and references process 1312 
prepares one or more entries and submits each entry in a suitable sequence to directive queue 1326. 
An entry may include suitable indicia corresponding to: (a) the directive at the current guidance node 

35 
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(e.g., the tag-identifier of the current element); (b) each symbol of each attribute-value pair (if any); 
and (c) enclosed text. 

Get next directive and references process 1312 may make any number of submissions 
in accordance with the current guidance node. More than one submission may be suitable for 

5 processing the declarative, expression evaluation, and assignment operations for each of several 

attribute-value pairs. Each such submission may result in intermediate calculations that provide one 
or more entries in control queue 1328. In other words, in one implementation, execute directive 
process 1340 plays a recursive and/or repetitive role in interpreting guidance at the current node. 
Some examples follow. A tag-identifier may be a reserved sjonbol (e.g., 'mask', or 'selection', etc.) 

10 easily identifiable without further processing; or a tag-identifier may be the name of a user-defined 

subroutine as discussed below. For each attribute-value pair, several operations including declaration, 
expression evaluation, and assignment may be submitted before the directive and references to these 
results are submitted. The enclosed text may refer to a variable (whose value is accessible only to 
executive 1304) that provides script (to be analyzed by parser 1302), for example, as discussed below 

1 5 with reference to the <eval> tag. 

Control queue 1328 provides results of intermediate calculations and control flow 
calculations accomplished by execute directive process 1340. Get next directive and references 
process 1312 may read control queue 1328 to determine, for example, the nature of a calculated tag- 
identifier, the calculated identifier of a referenced variable (e.g., a node identifier), or an intermediate 

20 result that may affect a future submission to directive queue 1326. Get next directive and references 
process 1312 may read control queue 1328 to determine a value for next guidance node so as to effect 
changes in control flow directed by procedural elements. 

Executive 1304 includes execute directive process 1340 and prepare output process 
1342. Execute directive process 1340 maintains current information (e.g., value and address or 

25 location) for declared variables, parameters, and subroutines. This information is maintained in 
separate stacks, one stack for variables, one for parameters and one for subroutines. Information 
regarding subroutines that is kept on the subroutine stack may include for each named subroutine, a 
node identifier in guidance tree 1322 where the declaration of the subroutine (e.g., elements, 
directives, script) may be found. By maintaining subroutine information, execute directive process 

30 1340 may determine values for next guidance node as discussed above. By maintaining variables in a 
stack, overloaded names for variables may be resolved to the most local instance in a conventional 
manner. Similarly, by maintaining parameters in a stack, overloaded names for parameters may be 
resolved to the most local instance in a conventional manner. An overloaded name is a name that is 
declared at more than one nested level in the guidance tree. For example, a variable named "x" may 

35 be declared as a variable at an outermost level and also declared inside a nested element (e.g., a pair 
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of <subroutine> tags or other paired tags). Semantically^ the innermost declaration of the reference is 
generally appropriate, as discussed above with reference to block structured programming. By using 
stacks for variable, parameter, and subroutine information, execute directive process 1348 may 
discard variables, parameters, and subroutine information when no longer appropriate to be used. For 
5 example, the local variables declared within an instance of a subroutine may be discarded (and the 
memory space reused) when that instance is complete (e.g., a return from subroutine has been 
effected). The variable stack may be implemented as a stack for string variables and a stack for tree 
variables. 

The top tree on the variable stack may be considered a default for all operations that 
10 do not otherwise refer explicitly to a tree that is implicit in the operation. In an alternate 

implementation, string variables and tree variables are stored on a heap and stack frames include 
pointers to the heap. 

Subroutine information (e.g., name and node number) for all subroutine declarations 
at the current level may be posted in a frame on the subroutine stack. A <mask>, <include>, 

15 <modules>, or <eval> element may be scanned for subroutine declarations before further processing. 
When a <subroutine> element is called, a new frame is created on the subroutine stack. Subroutine 
information at the first level within this subroutine element is posted in the new frame and discarded 
when the subroutine returns control to the calling context. 

Execute directive process 1340 reads submissions from directive queue 1326 and 

20 performs operations on references as specified in each submission. Execution status, errors, and 
intermediate results are submitted to control queue 1328 as discussed above. Each reference may 
identify a variable on the variable stack, a parameter on the parameter stack, a subroutine entry on the 
subroutine stack, a node of guidance tree 1322 (e.g., as the source of a constant), a node of content 
tree 1324 (e.g., for text to copy to derivative content), or a node of output tree 1350. 

25 Output tree 1350 is initially blank having neither structure nor contents. As directives 

from directive queue 1326 are executed, nodes and node contents are added to output tree 1350. 
Nodes may be revised and deleted also. As can be readily appreciated, since variables may be of tree 
type, portions of output tree 1350 may be easily declared, evaluated, and assigned with reference to, 
among other things, content tree 1324 (e.g., text and subtrees of primary content), variable stack 1348 

30 (e.g., text and subtrees resulting from expression evaluation and assignment), and guidance tree 1322 
(e.g., constants predefined as text and subtrees). Output tree 1350 preferably conforms to 
conventional a document object model. 

Prepare output process 1342 reads output tree 1350 and prepares derivative content in 
any suitable format, for example, a markup language (e.g., HTML with CSS, XML with XSL and/or 

35 XSLT tags, or any SGML). 
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The processes described herein may be implemented according to the principles and 
techniques described in conventional texts on compilers, interpreters, and execution environments, 
including, for example, "Compilers ~ Principals Techniques and Tools" by R. Fred Aho, Ravi Sethi, 
and Jeffery Ullman (1988 Edison Wesley) and references cited therein; and "Lex and Yacc" by Jung 
5 Levine, Tommy Mason, and Doug Brown (1995 O'Reiley) and references cited therein. 

Each element described in Table 4 may enclose any number of proper elements of the 
same or different types to a level of nesting limited by the implementation, not by the MASK 
language. 

In addition to the elements defined above, a process for transcoding according to 
10 various aspects of the present invention may be implemented to pass over elements that are not 

defined by the MASK language and not defined as user-defined subroutines via the MASK language. 
These elements may be treated as text for output as discussed above. For example, content which 
includes HTML elements, XML elements, and other SGML elements (CSS, XSL, MathML, XSLT, 
etc.) may be included in guidance (or included appended for example with the <include> element) at 
1 5 any position where a literal tree may occur (e.g., in an enclosure of a <def^ar> element). A process 
for transcoding in an alternate implementation processes <xslt:template> and <xslt:apply-template> 
elements in accordance with XSLT. 

The following examples illustrate techniques supported by transcode process 218 as 
discussed above. 
20 EXAMPLE 1: 

<mask version=l,0> 

<!-- A macro is declared that inserts a cgiheader having two newlines — > 
<defvar name= cgiheader value= "Content-type: text/html 



25 "/> 

<!-- The macro is called as follows: -> 
<eval><variable name=cgiheaderxy eval> 
<l mask> 

30 

EXAMPLE 2: 

<mask version=LO> 

<!-- A user name and birth date are sent to a cgi program on another server. — > 
<module url=http://whoswho.net/cgi-bin/users.cgi> 
35 <postfield>usemame="John Doe"&birthdate="10Augl965"</ postfield> 
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</ module> 
<l mask> 



EXAMPLE 3: 
5 <niask version=l .0> 

<!— A tree is sent to a server capable of receiving SOAP. — > 
<defVar name=messagel> 

<usemame>John Doe</usemame> 
<birthdate> 1 0 Aug 1 965</birthdate> 
10 </defvar> 

<module url-http://whoswho.net/registrar> 

<postfleld><variable name=messagel /></ postfield> 
<!— Results of SOAP calJ can go to the output tree ~> 
<selection><node id= 0 /></ selection> 
15 </module> 
</ mask> 



EXAMPLE 4: 

<mask version=l .0> 
20 <!" Literal values are passed by value to a subroutine. — > 

<!— The call to send some email — > 

<SendEmail address="wbachand@ssd,com" subject="ALERT'*> 
The Dow Jones Average has dropped 
25 over 200 points in today*s trading! 

</ SendEmail> 

<!-- The subroutine declaration. --> 
<subroutine name=SendEmail> 
30 <module url="http://globalmailerorg.com/cgi-bin/sendmai].cgi" /> 

<postfield> 

&address=<parameters select=@address /> 
&subject=<parameters select=@subject/> 
&mail=<parameters select^* /> 
35 </postfield> 
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</ module> 
</ subroutme> 
</ mask> 

5 EXAMPLES: 

<mask version=l .0> 

<!— The values of variables are passed by value to a subroutine, ~> 

<defvar name=to value="wbachand@ssd,com" /> 
1 0 <defvar name=re value="ALERT" > 

<!— The call to send some email — > 

<!- Because the following use of "to" and "re" can be recognized as variable names, the 
values of the parameters are set to the values of the "to" and "re" variables. — > 
<SendEmail address=to subject=re > 
1 5 The Dow Jones Average has dropped 

over 200 points in today's trading! 
</ SendEmail> 

<!— The subroutine declaration. ~> 
20 <subroutine name=SendEmail> 

<module url="http://globalmailerorg.com/cgi-bin/sendmail.cgi" /> 
<postfield> 

&address=<parameters select=@address /> 
&subject=<parameters select=@subject /> 
25 &mail=<parameters select^* /> 

</ postfield> 
<l module> 
</ subroutine> 
</ mask> 



30 



EXAMPLE 6: 



<mask version=l .0> 

<!-- An attribute and an enclosure are passed by reference.— > 
35 <!-- Add 2 to an attribute and 3 to an enclosed variable. ~> 
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<defVar name=A value="2" /> 
<defVar name=B value="3" /> 
<!— Now A is 2 and B is 3. — > 
<IncrementBoth first= "A">"B"<y IncrementBoth> 
5 <!- Now A is 6 and B is 8. -> 

<!— The subroutine declaration. ~> 

<subroutine name=IncrementBoth> 

<defvar name^PointToAttribxparameters seleclr=@first /></ defvar> 
10 <defVar name=PointToEnclo><parameters select=* /></ defvar> 

<assign name=$PointToAttrib type=add>4</ assign> 
<assign name=$PointToEncIo type=add>5</ assign> 

</ subroutine> 

</ mask> 

15 

EXAMPLE 7: 

<mask version=l .0> 

<!-- Create a composite tree for content and an analysis of the content. — > 

<defvar name=CompositeTree></ defv^ar> 
20 <module url=#CompositeTree> 

<description> 

<node id=0> 

<header> 

<module url=http://www.w3c.com> 
25 <selection><node id=0 l><l selection> 

<l module> 

</ header> 

</ node> 

</ description> 
30 </moduIe> 

<!- check if XML is in the page and report -> 

<regex pattem="[xX] [mM] [IL]" option=g > 
<variable name=CompositeTree /> 
35 </regex> 
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<]-- If the string XML occurred at position 23 and the string Xml occurred at 
position 46, a match node is inserted into the output tree as follows : 



10 



15 



<match size=2> 
<pos> 



<arg>23</ arg> 
<arg>26</ arg> 



</ pos> 
< pos> 



<arg>46</ arg> 
<arg>49</ arg> 



</ pos> 
</ match> 



— > 

</ mask> 
EXAMPLE 8: 

<mask version=l .0> 



20 



25 



30 



35 



<!— Somewhere in the middle of a purchase, tax must be added to the offered price for a 
product. The transcoding proxy server can calculate the state and federal tax using 
subroutines. Subroutines may be written before or after the call. Here they are declared after 
the call. — xassign name=product>jewelry</assign> 
<assign name=offer>$20,000</ assign> 

<!— TotalTax is set to sum of sales and luxury taxes — > 
<tax><state s=CA><sales p=$product o=$offer /></ sales></state> 
<!— while in the context of tax, output the price including tax — > 
<!— HTML tags will pass through as text and appear in the output... -> 
<p>Your total cost of the <variable name=product /> including tax is 
<expr eval=plus> 

<arg>$offer</ arg> 
<arg>$TotalTax</ arg> 
</ expr> 

.<p> 
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</tax> 

<!-- Use federal luxury tax in addition to state luxury tax if any. — > 
<subroutine name=tax> 
5 <defvar name=TotalTax value=0 /> 

<subroutine name=luxury> 

<if><cond eval=same> 
<arg>$p< arg> 
<arg>jewelry</ arg><y cond> 
1 0 <then><assign name=t value =$o> 

<expr eval=times> 
<arg>t</ arg> 

<arg>0.04<y argx/ Qxprxf then> 
<else><assign name=t value=0>*^ else> 
1 5 <assign name=TotalTax type=plus> 

<variable name=t /></ assignx/ if> 
</ subroutine name=state s=twoletter> 
<subroutine name=sales> 
<i£><cond eval=same> 
20 <arg>twoletter</ arg> 

<arg>CA</ arg></cond> 

<then> 

<assign nanie=t value=$o /> 
<expr eval=times> 
25 <arg>t</ arg> 

<arg>l .05</ argX/ exprx/ then> 
<else><assign name=t value=0></ else> 
<assign name=TotalTax type=plus> 

<variable name=t /></ assignx/ iC> 
30 <luxury /> 

</ subroutine> 
</ subroutine> 
<ysubroutine> 
</ mask> 

35 
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EXAMPLE 9: 

<mask version=l .0> 

<!-- Pass a subtree into a subroutine for processing ~> 
<subroutine name-abinc > 
5 <parameters selectr=@tree f> 

<module name=#$tree> 
<description> 

<parameters select=* t> 
<J description> 
10 < modulo 

</ subroutine> 

<defVar name=LocalTree> 
<AAA>12</AAA> 
15 <BBB>16</ BBB></ defVar> 

<!-- Increment AAA and BBB -> 
<abinc tree="LocaiTree" /> 
<!" AAA is now 13 and BBB is now 17 --> 
</ mask> 

20 

EXAMPLE 10: 

<mask version^l .0> 

<!— Assign a retum value from a subroutine to a variable — > 
<!- Subroutine declaration ~> 
25 <subroutine name=visited>Oregon and Califomia</ subroutine> 

<!— Call the subroutine as a function ~> 

<defvar name=states><visited /> =</ defvar> 
<!-- Variable "states" now has the value "Oregon and California" — > 
</ mask> 

30 

While for the sake of clarity and ease of description, several specific embodiments of 
the invention have been described; the scope of the invention is intended to be measured by the claims 
as set forth below. The description is not intended to be exhaustive or to limit the invention to the 
35 form disclosed. Other implementations of the invention will be apparent in light of the disclosure and 
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practice of the invention to one of ordinary skill in the art to which the invention applies. 
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